以下是关于体育促进干预措施有效性的假设荟萃分析的一些示例数据,我想创建一个森林情节:
example.df = data.frame(Author = c("McAuliffe et al.", "Palen et al.", "Manning et al.", "Richters et al.", "Grello et al.","Mpofu et al.", "Kuo & St Lawrence", "Langstrom & Hanson", "Ompad et al.", "Abdullah et al.","Yan", "Peltzer & Pengpid", "Lo & Wei", "Haggstrom-Nordin et al.", "Mwaba & Naidoo", "Hughes et al.","Lydie et al.", "Zimmer-Gembeck et al.", "Babalola", "Garos et al.", "Pinkerton et al."),
Sport = c("Basketball", "Basketball", "Baseball", "Dance", "Baseball", "Dance", "Wrestling","Wrestling", "Dance", "Baseball", "Wrestling", "Dance", "Swimming", "Swimming","Basketball", "Basketball", "Basketball", "Basketball", "Basketball", "Swimming", "Wrestling"),
Gender = c("Male", "Female", "Male", "Male", "Female", "Male", "Male", "Male", "Male", "Female","Female", "Male", "Female", "Female", "Female", "Male", "Female", "Female", "Female", "Male", "Female"),
d = c(-0.12, 0.53, 0.11, 0.02, 0.32, 0.04, 0.03,0.04,0.26, 0.76, 1.11, 0.34, 0.77, 1.19, 0.59, 0.15, 0.30, 0.81, 0.12, 0.11, 1.01),
d_SE = c(.10, .04, .06, .01, .11, .08, .08, .04, .05, .05, .14, .07, .05, .08, .19, .16, .07, .16, .06, .18, .15))
数据框包含作者姓名,运动,样本是男性还是女性,干预的效果大小以及效果大小的标准误差。我希望创建一个点图,将形状映射到性别,并通过特定的运动进行分面。按照Chang的“cookbook”和this related query中的示例进行操作后,我提出了满足大部分格式需求的以下代码:
p<-ggplot(example.df, aes(x=Author, y=d, ymin=d-1.96*d_SE, ymax=d+1.96*d_SE,shape=Gender))+
geom_pointrange() +
coord_flip()+
scale_y_continuous(limits=c(-2,2),breaks=c(-2,-1.5,-1,-0.5,0,.5,1,1.5,2))+
geom_hline(yintercept=0, color="grey60",linetype="dashed")+
theme_bw()+
theme(panel.grid.major.x=element_blank(),panel.grid.minor.x=element_blank(),panel.grid.major.y=element_line(color="grey60",linetype="dashed"))+
facet_grid(Sport ~ ., scales="free_y")
p
然而,我的问题是,每个方面(下方)的结果图在整个数据框中的每个作者都绘制在y轴上(技术上是x轴,但轴是翻转的)。相反,我只希望具有与给定构面相关的数据的作者绘制在该构面的作者关联轴上,因此每个构面应在轴上具有不同的作者列表。
我原以为scales="free_y"
图层的facet_grid
组件会确保每个构面的唯一作者轴(我已尝试scales="free_x"
,给定倒轴),但这没有预期的效果。
有没有人知道我可以确保每个方面轴上出现的唯一作者姓名是那些具有该方面相关数据的作者姓名?
答案 0 :(得分:14)
Andrie是对的,因为coord_flip()
似乎是问题的根源。但是,森林图格式的约定是在y轴上有作者名称,所以我想找到一种仍然符合这种格式要求的方法。
格雷戈尔评论的post中接受的答案实际上解决了我的问题;唯一需要的改变是我必须计算置信区间的上限/下限值的列。
现在使用更新的数据框:
example.df = data.frame(Author = c("McAuliffe et al.", "Palen et al.", "Manning et al.", "Richters et al.", "Grello et al.","Mpofu et al.", "Kuo & St Lawrence", "Langstrom & Hanson", "Ompad et al.", "Abdullah et al.","Yan", "Peltzer & Pengpid", "Lo & Wei", "Haggstrom-Nordin et al.", "Mwaba & Naidoo", "Hughes et al.","Lydie et al.", "Zimmer-Gembeck et al.", "Babalola", "Garos et al.", "Pinkerton et al."),
Sport = c("Basketball", "Basketball", "Baseball", "Dance", "Baseball", "Dance", "Wrestling","Wrestling", "Dance", "Baseball", "Wrestling", "Dance", "Swimming", "Swimming","Basketball", "Basketball", "Basketball", "Basketball", "Basketball", "Swimming", "Wrestling"),
Gender = c("Male", "Female", "Male", "Male", "Female", "Male", "Male", "Male", "Male", "Female","Female", "Male", "Female", "Female", "Female", "Male", "Female", "Female", "Female", "Male", "Female"),
d = c(-0.12, 0.53, 0.11, 0.02, 0.32, 0.04, 0.03,0.04,0.26, 0.76, 1.11, 0.34, 0.77, 1.19, 0.59, 0.15, 0.30, 0.81, 0.12, 0.11, 1.01),
d_SE = c(.10, .04, .06, .01, .11, .08, .08, .04, .05, .05, .14, .07, .05, .08, .19, .16, .07, .16, .06, .18, .15),
ci.low = c(-.30, .45, .00, -.01, .11, -.12, -.14, -.04, .16, .66, .84, .19, .68, 1.03, .22, -.17, .17, .50, .00, -.23, .72),
ci.high = c(.07, .62, .22, .05, .53, .20, .19, .11, .36, .87, 1.38, .47, .86, 1.35, .97,.47, .43, 1.11, .24, .46, 1.30))
#reorder Author based on value of d, so effect sizes can be plotted in descending order
example.df$Author<-reorder(example.df$Author, example.df$d, FUN=mean)
...然后是情节(没有任何coord_flip()
用法):
p <- ggplot(example.df, aes(y = Author, x = d, xmin = ci.low, xmax = ci.high, shape=Gender)) +
geom_point() +
geom_errorbarh(height = .1) +
scale_x_continuous(limits=c(-2,2),breaks=c(-2,-1.5,-1,-0.5,0,.5,1,1.5,2))+
geom_vline(xintercept=0, color="grey60",linetype="dashed")+
facet_grid(Sport ~ ., scales = "free", space = "free") +
theme_bw() +
theme(strip.text.y = element_text(angle = 0))
p
非常好 - 感谢所有的建议,并帮助解决这个情节!
答案 1 :(得分:4)
似乎coord_flip()
和方面的自由尺度不能很好地协同工作。这是一个已知问题(number 95 in the ggplot2 issue log),并且有迹象表明该修复程序是一个巨大的重写,不会很快完成。哈德利说:
自由尺度长时间不适用于非笛卡尔坐标系:/
这意味着您唯一的解决方法可能是删除coord_flip()
。例如:
试试这个:
library(ggplot2)
ggplot(example.df, aes(x=Author, y=d, ymin=d-1.96*d_SE, ymax=d+1.96*d_SE, shape=Gender, col=Gender))+
geom_pointrange() +
# coord_flip()+
scale_y_continuous(limits=c(-2,2),breaks=c(-2,-1.5,-1,-0.5,0,.5,1,1.5,2))+
theme_bw()+
theme(
panel.grid.major.x=element_blank(),
panel.grid.minor.x=element_blank(),
axis.text.x = element_text(angle=90, hjust=1)
) +
facet_grid(. ~ Sport, scales="free_x", space="free_x", shrink=TRUE, drop=TRUE)