ggarrange()函数会使我的箱形图颜色变深

时间:2020-08-19 18:05:05

标签: r ggplot2

我正在制作两个箱形图,并希望将它们彼此并排放置。当分别显示它们时,我使它们看起来都像我想要的,但是当我使用 ggarrange()时,颜色消失了。这是我的情节代码:

BOX1_data <- read.table(file = "clipboard", 
                      sep = "\t", header=TRUE)
BOX1_data$Diagnosis <- as.factor(BOX1_data$Diagnosis)
BOX1plot <- ggplot(BOX1_data, aes(x=Diagnosis, y=No.Variants, fill= Diagnosis)) + geom_boxplot() + 
    scale_fill_brewer(palette = "Dark2") +
    scale_x_discrete(labels = c("AC\nN=38", "SqCC\nN=15", "SCLC\nN=8", "BL disease\nN=16"))

BOX2_data <- read.table(file = "clipboard", 
                     sep = "\t", header=TRUE)
BOX2_data$Stage <- as.factor(BOX2_data$Stage)
BOX2plot <- ggplot(BOX2_data, aes(x=Stage, y=No.Variants, fill = Stage))    + geom_boxplot(width = 0.4) + 
    scale_fill_brewer(palette = "Dark2") + 
    scale_x_discrete(labels = c("Stage I-III\nN=24", "Stage IV\nN=37"))

然后安排我写的情节:

BOX_list <- list(BOX1plot, BOX2plot)
ggarrange(plotlist = BOX_list, labels = c('A', 'B'), ncol = 2)

我认为摆脱网格线等最简单的方法是使用theme_set(),我认为这可能是我的问题。 我的代码是:

theme_set(theme_bw() + theme(panel.border = element_blank(), panel.grid.major = element_blank(),
                    panel.grid.minor = element_blank(), panel.background = element_blank(), 
                    axis.line = element_line(colour = "grey")))

我意识到 theme_bw()会覆盖框中的颜色。但是我尝试删除它,将其切换为 theme_transparent()(这会删除我的所有标签),但均无效。我一直在寻找一种方法,可以仅在主题的框中添加透明度,以使我的颜色发光。我也很怀疑,也许我选择的调色板可能在我也不想要的两个图中给我相同的颜色。另外,如果重要的话,我在第一个情节中有4个小组,在第二个情节中有2个小组。

dput(BOX1_data)
structure(list(Diagnosis = structure(c(1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 3L, 
3L, 3L, 3L, 3L, 3L, 3L, 3L, 4L, 4L, 4L, 4L, 4L, 4L, 4L, 4L, 4L, 
4L, 4L, 4L, 4L, 4L, 4L, 4L), .Label = c("1", "2", "3", "4"), class = "factor"), 
    No.Variants = c(3L, 4L, 6L, 14L, 3L, 3L, 4L, 3L, 3L, 3L, 
    8L, 6L, 22L, 10L, 6L, 9L, 1L, 9L, 3L, 4L, 8L, 2L, 13L, 3L, 
    11L, 19L, 5L, 5L, 3L, 12L, 4L, 2L, 4L, 18L, 8L, 7L, 7L, 12L, 
    4L, 1L, 6L, 3L, 2L, 8L, 10L, 3L, 15L, 9L, 13L, 13L, 15L, 
    10L, 10L, 12L, 6L, 3L, 12L, 9L, 15L, 10L, 18L, 3L, 6L, 3L, 
    6L, 1L, 3L, 3L, 7L, 1L, 2L, 10L, 7L, 7L, 1L, 0L, 2L)), row.names = c(NA, 
-77L), class = "data.frame")
dput(BOX2_data)
structure(list(No.Variants = c(3L, 4L, 6L, 14L, 3L, 3L, 4L, 3L, 
3L, 3L, 8L, 6L, 22L, 10L, 6L, 9L, 1L, 9L, 3L, 4L, 8L, 2L, 13L, 
3L, 11L, 19L, 5L, 5L, 3L, 12L, 4L, 2L, 4L, 18L, 8L, 7L, 7L, 12L, 
4L, 1L, 6L, 3L, 2L, 8L, 10L, 3L, 15L, 9L, 13L, 13L, 15L, 10L, 
10L, 12L, 6L, 3L, 12L, 9L, 15L, 10L, 18L), Stage = structure(c(1L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 
2L, 2L, 2L, 2L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L), .Label = c("1", 
"2"), class = "factor")), row.names = c(NA, -61L), class = "data.frame")

感谢任何提示!

2 个答案:

答案 0 :(得分:0)

如果您对ggarrange()有疑问,我建议使用patchwork的下一种方法:

library(ggplot2)
library(patchwork)

#Data format
BOX1_data$Diagnosis <- as.factor(BOX1_data$Diagnosis)
#Plot 1
BOX1plot <- ggplot(BOX1_data, aes(x=Diagnosis, y=No.Variants, fill= Diagnosis)) + geom_boxplot() + 
  scale_fill_brewer(palette = "Dark2") +
  scale_x_discrete(labels = c("AC\nN=38", "SqCC\nN=15", "SCLC\nN=8", "BL disease\nN=16"))
#Data format
BOX2_data$Stage <- as.factor(BOX2_data$Stage)
#Plot 2
BOX2plot <- ggplot(BOX2_data, aes(x=Stage, y=No.Variants, fill = Stage))    + geom_boxplot(width = 0.4) + 
  scale_fill_brewer(palette = "Dark2") + 
  scale_x_discrete(labels = c("Stage I-III\nN=24", "Stage IV\nN=37"))

#Arrange plots
BOX1plot+BOX2plot+plot_annotation(tag_levels = 'A')

输出:

enter image description here

答案 1 :(得分:0)

正如已经指出的,看来OP的theme_set()删除了两个绘图中设置的填充色的问题已通过更新为ggplot2的新版本来解决。在此,我对OP的第二部分有一个解决方案(在注释中已阐明)。为方便起见,在此表示:

现在,这只是我希望调色板在第二个绘图的框上继续而不是重新启动的问题,以便我在所有框上获得不同的颜色。

为此,必须认识到第一个图BOX1plot有4种填充颜色,BOX2plot有2种填充颜色。对于BOX1plot,我们希望调色板从第一种颜色开始,而对于BOX2plot,我们希望调色板从调色板中的第5个颜色序列开始。无法通过scale_*_brewer()函数执行此操作,因此此处的方法将是从RcolorBrewer::brewer.pal()访问Brewer面板,然后根据级别数指定从何处开始和结束的顺序使用scale_fill_manual()来设置每个因子的大小,只需从提取的Brewer调色板中设置颜色值即可。

您只需“知道” BOX1plot就需要“使用颜色1-4”,BOX2plot就需要“使用颜色5和6”;但是,仅根据级别数自动计算该值(如果要再次运行此值)会更优雅。下面的代码执行此操作:

library(ggplot2)
library(ggpubr)
library(RColorBrewer)

# ... read in your data as before

# create factors (as OP did before)
BOX1_data$Diagnosis <- as.factor(BOX1_data$Diagnosis)
BOX2_data$Stage <- as.factor(BOX2_data$Stage)

# make color palette based on Brewer "Dark2" palette
lev_diag <- length(levels(BOX1_data$Diagnosis))
lev_stage <- length(levels(BOX2_data$Stage))
lev_total <- lev_diag + lev_stage
my_colors <- brewer.pal(lev_total, "Dark2")

BOX1plot <- ggplot(BOX1_data, aes(x=Diagnosis, y=No.Variants, fill= Diagnosis)) + geom_boxplot() + 
  scale_fill_manual(values=my_colors[1:lev_diag]) +
  scale_x_discrete(labels = c("AC\nN=38", "SqCC\nN=15", "SCLC\nN=8", "BL disease\nN=16"))

BOX2plot <- ggplot(BOX2_data, aes(x=Stage, y=No.Variants, fill = Stage))    + geom_boxplot(width = 0.4) + 
  scale_fill_manual(values = my_colors[(lev_diag+1):lev_total]) +
  scale_x_discrete(labels = c("Stage I-III\nN=24", "Stage IV\nN=37"))

BOX_list <- list(BOX1plot, BOX2plot)
ggarrange(plotlist = BOX_list, labels = c('A', 'B'), ncol = 2)

enter image description here