如何将堆积百分比条形图标签居中

时间:2016-01-20 14:57:35

标签: r ggplot2 bar-chart labels

我正在尝试使用stacked percent barchart绘制好ggplot2。我已经阅读了一些材料,几乎可以设法绘制,我想要什么。另外,我附上材料,它可能在一个地方有用:

How do I label a stacked bar chart in ggplot2 without creating a summary data frame?

Create stacked barplot where each stack is scaled to sum to 100%

R stacked percentage bar plot with percentage of binary factor and labels (with ggplot)

我的问题是我无法将labels放在我想要的地方 - 在酒吧中间。 enter image description here

您可以在上面的图片中看到问题 - 标签看起来很糟糕,也相互重叠。

我现在正在寻找的是:

  1. 如何在标杆(区域)的中间放置标签

  2. 如何绘制并非所有标签,但例如哪些标签超过10%?

  3. 如何解决重叠问题?

  4. Q 1. @MikeWise建议solution。但是,我仍然无法解决这个问题。

    另外,我附上了可复制的例子,我是如何绘制这个grahp的。

    library('plyr')
    library('ggplot2')
    library('scales')
    set.seed(1992)
    n=68
    
    Category <- sample(c("Black", "Red", "Blue", "Cyna", "Purple"), n, replace = TRUE, prob = NULL)
    Brand <- sample("Brand", n, replace = TRUE, prob = NULL)
    Brand <- paste0(Brand, sample(1:5, n, replace = TRUE, prob = NULL))
    USD <- abs(rnorm(n))*100
    
    df <- data.frame(Category, Brand, USD)
    
    # Calculate the percentages
    df = ddply(df, .(Brand), transform, percent = USD/sum(USD) * 100)
    
    
    # Format the labels and calculate their positions
    df = ddply(df, .(Brand), transform, pos = (cumsum(USD) - 0.5 * USD))
    
    #create nice labes
    df$label = paste0(sprintf("%.0f", df$percent), "%")  
    
    
    
    ggplot(df, aes(x=reorder(Brand,USD,
                                  function(x)+sum(x)),  y=percent, fill=Category))+
      geom_bar(position = "fill", stat='identity',  width = .7)+
      geom_text(aes(label=label, ymax=100, ymin=0), vjust=0, hjust=0,color = "white",  position=position_fill())+
      coord_flip()+
      scale_y_continuous(labels = percent_format())+
      ylab("")+
      xlab("")
    

2 个答案:

答案 0 :(得分:30)

以下是如何使标签居中并避免为小百分比绘制标签。数据中的另一个问题是每种颜色都有多个条形部分。相反,在我看来,应该结合给定颜色的所有条形部分。以下代码使用dplyr代替plyr来设置绘图数据:

library(dplyr)

# Initial data frame   
df <- data.frame(Category, Brand, USD)

# Calculate percentages and label positions
df.summary = df %>% group_by(Brand, Category) %>% 
  summarise(USD = sum(USD)) %>%   # Within each Brand, sum all values in each Category
  mutate(percent = USD/sum(USD),
         pos = cumsum(percent) - 0.5*percent)

要绘制数据,请使用ifelse语句确定是否绘制了标签。在这种情况下,我避免为百分比小于7%绘制标签。

ggplot(df.summary, aes(x=reorder(Brand,USD,function(x)+sum(x)), y=percent, fill=Category)) +
  geom_bar(stat='identity',  width = .7, colour="black", lwd=0.1) +
  geom_text(aes(label=ifelse(percent >= 0.07, paste0(sprintf("%.0f", percent*100),"%"),""),
                y=pos), colour="white") +
  coord_flip() +
  scale_y_continuous(labels = percent_format()) +
  labs(y="", x="")

enter image description here

UPDATE:使用ggplot2版本2,不再需要计算文本标签的坐标以使它们居中。相反,您可以使用position=position_stack(vjust=0.5)。例如:

ggplot(df.summary, aes(x=reorder(Brand, USD, sum), y=percent, fill=Category)) +
  geom_bar(stat="identity", width = .7, colour="black", lwd=0.1) +
  geom_text(aes(label=ifelse(percent >= 0.07, paste0(sprintf("%.0f", percent*100),"%"),"")),
                position=position_stack(vjust=0.5), colour="white") +
  coord_flip() +
  scale_y_continuous(labels = percent_format()) +
  labs(y="", x="")

enter image description here

答案 1 :(得分:1)

我按照这个例子找到了如何为简单的堆叠条形图放置好标签的方法。我认为它也可能有用。

df <- data.frame(Category, Brand, USD)

# Calculate percentages and label positions
df.summary = df %>% group_by(Brand, Category) %>% 
  summarise(USD = sum(USD)) %>%   # Within each Brand, sum all values in each Category
  mutate( pos = cumsum(USD)-0.5*USD)

ggplot(df.summary, aes(x=reorder(Brand,USD,function(x)+sum(x)), y=USD, fill=Category)) +
  geom_bar(stat='identity',  width = .7, colour="black", lwd=0.1) +
  geom_text(aes(label=ifelse(USD>100,round(USD,0),""),
                y=pos), colour="white") +
  coord_flip()+
  labs(y="", x="")

enter image description here