我试图用我自己的数据集在Google的Rhythm of Food复制美丽的可视化效果,显示我公司每周雇用的人数。数据集(名为hiresbyweek)看起来像这样(这是81行中的25行,link to full dataset here)
Week Year total.Hires Month WeekNum
2014-05-05 0:00:00 2014 1 May 18
2014-05-12 0:00:00 2014 1 May 19
2014-05-19 0:00:00 2014 1 May 20
2014-05-26 0:00:00 2014 1 May 21
2014-08-04 0:00:00 2014 1 August 31
2014-09-08 0:00:00 2014 1 September 36
2015-02-23 0:00:00 2015 3 February 08
2015-03-23 0:00:00 2015 4 March 12
2015-05-04 0:00:00 2015 1 May 18
2015-06-01 0:00:00 2015 1 June 22
2015-06-08 0:00:00 2015 1 June 23
2015-09-14 0:00:00 2015 3 September 37
2015-09-21 0:00:00 2015 4 September 38
2015-09-28 0:00:00 2015 15 September 39
2015-10-05 0:00:00 2015 20 October 40
2015-10-12 0:00:00 2015 47 October 41
2015-10-19 0:00:00 2015 40 October 42
2015-10-26 0:00:00 2015 39 October 43
2015-11-02 0:00:00 2015 5 November 44
2015-11-09 0:00:00 2015 2 November 45
2015-11-16 0:00:00 2015 7 November 46
2015-11-23 0:00:00 2015 1 November 47
2015-11-30 0:00:00 2015 7 November 48
2015-12-07 0:00:00 2015 3 December 49
2015-12-14 0:00:00 2015 7 December 50
目前我已经做到了这一点:
ggplot(hiresbyweek,aes( x=WeekNum, y=total.Hires,fill=as.factor(Year)))
+geom_histogram(stat="identity", aes( x=WeekNum, y=total.Hires,fill=as.factor(Year)))
+coord_polar()
+scale_fill_manual(values=c("#ACD9F4","#005DA6","#EC008C"))
+scale_x_discrete(labels = as.factor(hiresbyweek$Month))
+scale_y_discrete(expand=c(0.5,0))
+theme(text=element_text(family="Avenir")
, axis.ticks = element_blank()
, panel.grid = element_blank()
, panel.background = element_blank()
)
这产生了一些接近的东西:
基本问题是:
1)这些标签不在接近应有的位置: 请注意10月份的最大数字是多少,但根据图表,他们将主要在4月或3月。
很高兴:
1)我想按照食物图表的节奏对这些标题进行分组和旋转,这样就会有更简单的标签
2)我想大大减少所述酒吧的相对大小;我已将其作为计数(geom_historgram(stat =" count")或stat =" bin")但这使它们全部相等并消除了scale的重要性,这是关键是这里。
3)我想在条形图之间插入一些空格。我试过添加颜色="白色" a la ggplot(hiresbyweek,aes(x = WeekNum,y = total.Hires,color =" white",fill = as.factor(Year)))和geom_histogram(stat =" identity& #34;,aes(x = WeekNum,y = total.Hires,fill = as.factor(Year),color =" white"))这两个都奇怪地得到了粉红色的轮廓......
第一部分的帮助是最重要的(我觉得它很明显),但欢迎任何和所有人。感谢您的时间和想法。
答案 0 :(得分:3)
我一直在等待其他人发布一个更好,更少hackish的答案,但我希望这会在此期间做到。
# 1. We can control the order of geom_bars based on the levels of the factor of X.
# So we make a new factor variable and ensure that the levels are in the order of
# < January1, January2, ..., February2, ..., December3, December4 >
hiresbyweek <- hiresbyweek[order(hiresbyweek$WeekNum),]
hiresbyweek$X <- factor(paste0(hiresbyweek$WeekNum, hiresbyweek$Month),
levels = unique(paste0(hiresbyweek$WeekNum, hiresbyweek$Month)))
# 2. But we don't want the axis labels to be: "Jan1, Jan2, Jan3, ..."
# Instead we'll extract only the month out of the X variable (though notice the weekNum
# variable was important so we could get the right order and distinct factor levels)
# But we also don't want repeated axis labels: "Jan, "Jan", "Jan", "Feb", "Feb", ....
# So try to place the unique axis label close to the middle, and leave the rest blank
# (ie. "", "Jan", "", "", "Feb")
makeLabels <- function(x) {
x <- gsub("[0-9]", "", x)
labs <- c();
for (a in unique(x)) {
b <- rep("", length(x[x == a]))
b[ ceiling(length(x[x==a])/2) ] <- a
labs <- append(labs, b)
}
return(labs)
}
# 3. Angle the axis labels to imitate Google's Rhythm of Food
ang <- -360 / length(unique(hiresbyweek$X)) * seq_along(hiresbyweek$X)
ang[ang <= -90 & ang >= -300] <- ang[ang <= -90 & ang >= -300] -180
ggplot(hiresbyweek, aes( x = X, y = total.Hires,fill = as.factor(Year))) +
geom_histogram(stat="identity", width = 0.5) + # Use width arg for more space between bars
coord_polar() +
scale_x_discrete(labels = makeLabels) + # Apply getLabel function to X
scale_y_discrete(expand=c(0.5,0)) +
scale_fill_manual(values=c("#ACD9F4","#005DA6","#EC008C")) +
theme(axis.ticks = element_blank(),
panel.grid = element_blank(),
panel.background = element_blank(),
text = element_text(family="Avenir"),
title = element_blank(), # Remove all titles
axis.text.x = element_text(angle= ang)) # Apply angles to x-axis labels