分组boxplot r ggplot2

时间:2016-04-19 12:31:38

标签: r ggplot2 boxplot

我有5列数值数据(Equipment,Hyiene.items等)和1列分类数据(A或D)。 我想制作按类别分组的数字数据的分组箱图,但我找不到方法:

 head(sc)
  Equipment Hygiene.items Patient Near.bed Far.bed Care
1         0             0       1        5       1    D
2         1             4       1        2       0    D
3         3             1       1        2       0    D
4         0             2       2        3       1    A
5         1             2       1        5       2    A
6         1             2       1        1       1    A

boxplot(sc~sc$Care)似乎是最合适的方式吗? 我喜欢ggplot2,但它看起来不像我能做到这一点:

ggplot(sc, aes(y=sc)) + 
  geom_boxplot(aes(fill=Care))

编辑:我喜欢的外观:

我认为我之后的事情就像我在Matlab(很久以前)制作的那样:

enter image description here

或此处的第4张图:Plotly

enter image description here

到目前为止我有什么:

library(ggplot2)
library(RColorBrewer)

ggplot(melt_A,aes(x=Care,y=value,fill=Care))+geom_boxplot(ylim=c(1,6,1))+facet_grid(~variable)+
labs(x = "Care", y = "Surface contacts",color="Care" )+
  scale_y_continuous(limits = c(-0, 6))+
  scale_fill_brewer(palette="Purples")+
  theme_bw()+
  theme(strip.background=element_rect(fill="black"))+
  theme(strip.text=element_text(color="white", face="bold"))

问题

如何将护理标签从D,H,Me更改为其他标签?例如直接护理,家政,药物治疗等......

修正:

在这里找到答案:Stack

我将以下内容添加到我的ggplot命令

scale_fill_brewer(palette="Purples",
  labels = c("Direct care", "Housekeeping","Medication    round","Mealtimes","Miscellaneous care","Personal care"))

enter image description here

2 个答案:

答案 0 :(得分:12)

您的data.frame格式不正确。我将您的数据命名为“A”。你需要

library(reshape2)
melt_A<-melt(A)

现在您将“Care”变量用作ID,并在data.frame中使用适合ggplot2的值的变量

melt_A
   Care      variable value
1     D     Equipment     0
2     D     Equipment     1
3     D     Equipment     3
4     A     Equipment     0
5     A     Equipment     1
6     A     Equipment     1
7     D Hygiene.items     0
8     D Hygiene.items     4
9     D Hygiene.items     1
10    A Hygiene.items     2
11    A Hygiene.items     2
12    A Hygiene.items     2
13    D       Patient     1
14    D       Patient     1
15    D       Patient     1
16    A       Patient     2
17    A       Patient     1
18    A       Patient     1
19    D      Near.bed     5
20    D      Near.bed     2
21    D      Near.bed     2
22    A      Near.bed     3
23    A      Near.bed     5
24    A      Near.bed     1
25    D       Far.bed     1
26    D       Far.bed     0
27    D       Far.bed     0
28    A       Far.bed     1
29    A       Far.bed     2
30    A       Far.bed     1

这是您可能想要对数据进行的一个可能的情节

ggplot(melt_A,aes(x=Care,y=value,fill=Care))+geom_boxplot()+facet_wrap(~variable)

enter image description here

答案 1 :(得分:1)

您需要将所有列收集到一个列中,然后将它们映射到x,并将它们的计数映射到y。然后,您只需要将颜色映射到此列中的每个因素,并为每种护理类型手动设置alpha。

---
title: "Boxplots"
output: html_document
---

```{r setup, include=FALSE}
library(tidyverse)
library(ggplot2)


```

```{r base-data}
a <- tibble(Equipment     = sample(1:10, 50, replace = T),
            Hygiene.items = sample(1:10, 50, replace = T),
            Patient       = sample(1:10, 50, replace = T),
            Near.bed      = sample(1:10, 50, replace = T),
            Far.bed       = sample(1:10, 50, replace = T),
            Care          = sample(c("A", "D"), 50, replace = T)) %>% 
  gather(key = "Context", value = "Count", -Care)



```


```{r boxplot, echo=FALSE}

ggplot(data = a) +

  geom_boxplot(aes(x = Context,
                   y     = Count,
                   fill  = Context,
                   alpha = Care)) +

  scale_alpha_manual(values = c(0.7, 1))

```

enter image description here