Question

我有一个ggplot，显示了一些品牌的推文数量以及整体百分比的标签。这是通过以下链接提供的：Show % instead of counts in charts of categorical variables

# plot ggplot of brands
ggplot(data = test, aes(x = brand, fill = brand)) 
+ geom_bar() 
+ stat_bin(aes(label = sprintf("%.02f %%", ..count../sum(..count..)*100)), geom = 'text', vjust = -0.3)

Plot by Brand

接下来，我想根据品牌和情绪来绘制它，每个品牌的酒吧标签总计高达100％。但是，我很难修改我的代码来执行此操作。你能帮忙吗？此外，是否可以将neu的颜色更改为蓝色，将pos更改为绿色？

# plot ggplot of brands and sentiment
ggplot(data = test, aes(x = brand, fill = factor(sentiment))) 
+ geom_bar(position = 'dodge') 
+ stat_bin(aes(label = sprintf("%.02f %%", ..count../sum(..count..)*100)), geom = 'text', position = position_dodge(width = 0.9), vjust=-0.3)

Plot by Brand and Sentiment

这是我的数据的100行品牌和情感栏的数据

structure(list(brand = structure(c(3L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 2L, 1L, 1L, 2L, 3L, 4L, 4L, 1L, 2L, 1L, 2L, 1L, 3L, 3L, 3L, 
1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
1L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 3L, 5L, 2L, 1L, 2L, 1L, 1L, 2L, 
2L, 1L, 4L, 5L, 5L, 1L, 1L, 2L, 3L, 1L, 1L, 4L, 1L, 2L, 1L, 2L, 
1L, 1L, 1L, 1L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 2L, 1L, 
1L, 3L, 2L, 2L, 2L, 3L, 3L, 3L, 1L, 1L, 4L, 1L, 1L), .Label = c("apple", 
"samsung", "sony", "bb", "htc", "nokia", "huawei"), class = "factor"), 
    sentiment = structure(c(2L, 1L, 3L, 1L, 2L, 3L, 1L, 1L, 3L, 
    1L, 1L, 2L, 3L, 1L, 1L, 3L, 2L, 1L, 3L, 1L, 3L, 3L, 3L, 2L, 
    1L, 2L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 1L, 3L, 2L, 1L, 1L, 2L, 
    2L, 1L, 1L, 1L, 1L, 2L, 3L, 1L, 3L, 3L, 3L, 3L, 3L, 3L, 1L, 
    3L, 1L, 1L, 1L, 3L, 3L, 2L, 1L, 1L, 2L, 3L, 3L, 1L, 3L, 2L, 
    1L, 3L, 1L, 2L, 3L, 3L, 3L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 
    3L, 1L, 3L, 1L, 1L, 3L, 3L, 3L, 3L, 3L, 2L, 1L, 1L, 1L, 1L, 
    3L), .Label = c("neg", "pos", "neu"), class = "factor")), .Names = c("brand", 
"sentiment"), class = c("data.table", "data.frame"), row.names = c(NA, 
-100L), .internal.selfref = <pointer: 0x0000000003070788>)

Answer 1

发布远离ggplot2惯用法的黑客攻击，所以如果有人发布更多ggplot2方式来执行此操作，则应接受惯用方法。

所以基本上我正在创建一个虚拟数据集，其中包含您使用..count../sum(..count..)*100计算的所有信息，并使用geom_text

将其绘制在条形图的顶部

temp <- as.data.frame(table(test$brand, test$sentiment))
temp <- merge(temp, as.data.frame(table(test$brand)), by = "Var1", all.x = T)
names(temp) <- c("brand", "sentiment", "Freq", "Count")

library(ggplot2)
ggplot(data = test, aes(x = brand, fill = factor(sentiment))) + 
  geom_bar(position = 'dodge') + 
  geom_text(data = temp, aes(x = brand, y = Freq, label = sprintf("%.02f %%", Freq/Count*100)),  position = position_dodge(width = 0.9), vjust=-0.3)

enter image description here

这与您的情节不完全相同，因为您只提供了数据的子集

Answer 2

要选择您希望获得情感的颜色，请使用

scale_fill_manual（value = [并按RGB，名称等选择颜色]

您必须进行实验，但这三个因素将按字母顺序排列（除非您更改），因此您为比例选择的颜色将匹配该顺序：neg，neu，pos可能是＆＃34;灰色＆＃34 ;，＆＃34;蓝色＆＃34;，＆＃34;绿色＆＃34;

ggplot根据x轴变量添加百分比标签

2 个答案: