我在ggplot中有一个堆积百分比条形图,我想将总观察数放在堆叠条形图的顶部(同时保持堆叠条形图的百分比)。然而,我一直遇到问题。
以下是我生成百分比条形图的代码:
# sample dataset
set.seed(123)
cat1<-sample(letters[1:3], 500, replace=T, prob=c(0.1, 0.2, 0.65))
cat2<-sample(letters[4:8], 500, replace=T, prob=c(0.3, 0.4, 0.75, 0.5, 0.1))
df <- data.frame(cat1, cat2)
# the barplot
ggplot(df, aes(x=cat1))+
geom_bar(aes(fill = cat2),
position = 'fill',color = "black")+
scale_y_continuous(labels = scales::percent)+
labs ( y = "Percentage")+
# this final line is me trying to add the label
geom_text(aes(label=cat1))
# this is the observation number I want display
table(df$cat1)
# but I get this error:
Error: geom_text requires the following missing aesthetics: y
所以我有两个问题:
谢谢!
答案 0 :(得分:2)
你可以尝试
temp <- data.frame(x=c("a", "b", "c"), y=c(1.02, 1.02, 1.02), z=c(51, 101, 348))
ggplot(df, aes(x=cat1))+
geom_bar(aes(fill = cat2),
position = 'fill',color = "black")+
scale_y_continuous(labels = scales::percent)+
labs ( y = "Percentage")+
# this final line is me trying to add the label
geom_text(data=temp, aes(x=x, y=y, label=as.factor(z)))
答案 1 :(得分:2)
如果您不想对摘要标签进行硬编码,请使用dplyr
来计算百分比和格式化标签,这是一种略有不同的方法(但仍然有些不同)。
我还推翻了你的传奇以匹配图表上的顺序:)
library(dplyr)
df2 <- df %>%
group_by(cat1, cat2) %>%
summarise(n=n())%>%
mutate(percent = (n / sum(n)), cumsum = cumsum(percent), label=ifelse(cat2=="h", paste0("N=", sum(n)),""))
ggplot(df2,aes(x=cat1, y=percent, fill=cat2)) +
scale_y_continuous(labels = scales::percent) +
labs ( y = "Percentage") +
geom_bar(position = 'fill',color = "black", stat="identity") +
geom_text(aes(y=cumsum, label=label), vjust=-1) +
guides(fill=guide_legend(reverse=T))