r - 如何正确添加标签到ggplot2水平箱图

时间:2017-12-01 15:43:50

标签: r ggplot2 label boxplot

编辑提供可重复的例子。

示例数据:https://owncloud.cesnet.cz/index.php/s/oopPE2Ut4quSVOK

拥有此数据

> head(df)
  Sample_Name        E Sentrix_ID Sentrix_ID_full
1    P129C1S1 5.636927 5058818037    5058818037_A
2    P129C1S1 5.794948 5058818037    5058818037_A
3    P129C1S1 5.608488 5058818037    5058818037_A
4    P129C1S1 5.989108 5058818037    5058818037_A
5    P129C1S1 5.570090 5058818037    5058818037_A
6    P129C1S1 5.555401 5058818037    5058818037_A

我正在通过

生成水平箱图
library(ggplot2)
library(ggthemes)

df <- read.csv("sample.csv")
df$Sentrix_ID <- as.factor(df$Sentrix_ID)
df$Sentrix_ID_full <- as.factor(df$Sentrix_ID_full)

head(df)

p <- ggplot(data = df,
            aes(y = E, x = Sentrix_ID_full, color = Sentrix_ID, label = Sample_Name)) +
  geom_boxplot(outlier.shape = NA) +
  scale_colour_manual(values = c("5058818037" = "red", "5226121006" = "green")) +
  theme_few() +
  guides(colour = FALSE) +
  labs(x = "Sentrix ID", y = "E", title = "intensity values") +
  coord_flip()

p

我想为每个箱图添加标签:

p <- p + geom_text()

但这种情况发生了:

有什么解决方案吗?提前谢谢你:)

1 个答案:

答案 0 :(得分:4)

一个建议:使用第二个数据框只标记每个箱图一次并在指定的位置:

# mock data
set.seed(1)
df <- data.frame(var_1 = c(rep("A", 15), rep("B", 15), rep("C", 15)),
             value = sample(100, 45, replace= T))

# the kind of plot you have
ggplot(df, aes(x = var_1, y = value, label = var_1)) +
  geom_boxplot() +
  coord_flip() +
  geom_text()

plot_1

# Use of plyr package
library(plyr)
df2 <- ddply(df, "var_1", summarize, max_value = max(value, na.rm = T))
# I chose max_value as position for the label

# the new plot
ggplot(df, aes(x = var_1, y = value)) +
  geom_boxplot() +
  coord_flip() +
  geom_text(data = df2, aes(x = var_1, y = max_value, label = var_1), hjust = -0.5) # use hjust to adjust horizontal position 

enter image description here