如何从ggplot2中的汇总数据创建堆积条形图

时间:2012-08-11 22:12:17

标签: r ggplot2 bar-chart

我正在尝试使用ggplot 2创建堆积条形图。我的数据格式很宽,看起来像这样。每个单元格中的数字是响应的频率。

activity                         yes    no  dontknow
Social events                     27    3   3
Academic skills workshops         23    5   8
Summer research                   22    7   7
Research fellowship               20    6   9
Travel grants                     18    8   7
Resume preparation                17    4   12
RAs                               14    11  8
Faculty preparation               13    8   11
Job interview skills              11    9   12
Preparation of manuscripts        10    8   14
Courses in other campuses          5    11  15
Teaching fellowships               4    14  16
TAs                                3    15  15
Access to labs in other campuses   3    11  18
Interdisciplinary research         2    11  18
Interdepartamental projects        1    12  19

我使用reshape2和

融合了这张桌子
 melted.data(wide.data,id.vars=c("activity"),measure.vars=c("yes","no","dontknow"),variable.name="haveused",value.name="responses")

这就是我能得到的。我想创建一个堆积条形图,其中包含x轴上的活动,y轴上的响应频率,以及显示yes,nos和dontknows分布的每个条形

我试过

ggplot(melted.data,aes(x=activity,y=responses))+geom_bar(aes(fill=haveused))

但我担心这不是正确的解决方案

非常感谢任何帮助。

1 个答案:

答案 0 :(得分:5)

你还没有说过你的解决方案是不对的。但是一些可以解释为问题的问题,以及每个问题的一个可能的解决方案是:

  • x轴刻度线标签相互碰撞。解决方案 - 旋转刻度线标签;
  • 标签(及其对应的条形)出现的顺序与原始数据框中的顺序不同。解决方案 - 重新排序因子'活动'的水平;
  • 要在条形图内放置文字,请将vjust中的position_stack参数设置为0.5

以下可能是一个开始。

    # Load required packages
library(ggplot2)
library(reshape2)

    # Read in data
df = read.table(text = "
activity                         yes    no  dontknow
Social.events                     27    3   3
Academic.skills.workshops         23    5   8
Summer.research                   22    7   7
Research.fellowship               20    6   9
Travel.grants                     18    8   7
Resume.preparation                17    4   12
RAs                               14    11  8
Faculty.preparation               13    8   11
Job.interview.skills              11    9   12
Preparation.of.manuscripts        10    8   14
Courses.in.other.campuses          5    11  15
Teaching.fellowships               4    14  16
TAs                                3    15  15
Access.to.labs.in.other.campuses   3    11  18
Interdisciplinay.research          2    11  18
Interdepartamental.projects        1    12  19", header = TRUE, sep = "")

    # Melt the data frame
dfm = melt(df, id.vars=c("activity"), measure.vars=c("yes","no","dontknow"),
    variable.name="haveused", value.name="responses")

    # Reorder the levels of activity
dfm$activity = factor(dfm$activity, levels = df$activity)

    # Draw the plot
ggplot(dfm, aes(x = activity, y = responses, group = haveused)) + 
geom_col(aes(fill=haveused)) +
theme(axis.text.x = element_text(angle = 90, hjust = 1, vjust = 0.25)) +
geom_text(aes(label = responses), position = position_stack(vjust = .5), size = 3)  # labels inside the bar segments