具有频率和百分比的两向列联表

时间:2019-05-11 15:35:57

标签: r dplyr tidyverse contingency janitor

我得到了下面的双向列联表,其中包含细胞百分比以及频率(在括号中)。

gender       blue blue-gray       brown      dark      hazel    yellow
 female 33.33% (3) 0.00% (0) 55.56%  (5) 0.00% (0) 11.11% (1) 0.00% (0)
   male 34.62% (9) 3.85% (1) 46.15% (12) 3.85% (1)  3.85% (1) 7.69% (2)

我使用的R代码是

library(dplyr)

library(janitor)

starwars %>%
  filter(species == "Human") %>% 
  tabyl(gender, eye_color) %>%
  adorn_percentages("row") %>%
  adorn_pct_formatting(digits = 2) %>%
  adorn_ns()

但是,我想获得带有电池频率和百分比(括号中)的相同类型的表格。请帮忙。

1 个答案:

答案 0 :(得分:2)

我们可以将position中的adorn_ns参数从rear(默认)更改为front

library(tidyverse)
starwars %>%
  filter(species == "Human") %>% 
   tabyl(gender, eye_color) %>%
   adorn_percentages("row") %>%
   adorn_pct_formatting(digits = 2) %>%
   adorn_ns(position = "front")
# gender       blue blue-gray       brown      dark      hazel    yellow
# female 3 (33.33%) 0 (0.00%)  5 (55.56%) 0 (0.00%) 1 (11.11%) 0 (0.00%)
#   male 9 (34.62%) 1 (3.85%) 12 (46.15%) 1 (3.85%) 1  (3.85%) 2 (7.69%)

或者如果已经创建了对象,则可以使用mutate_at进行后处理,以通过捕获两个块中的字符来更改除第一列以外的所有列的格式,并在添加时通过反转引用来反转位置()表示百分比

library(tidyverse)
starwars %>%
  filter(species == "Human") %>% 
  tabyl(gender, eye_color) %>%
  adorn_percentages("row") %>%
  adorn_pct_formatting(digits = 2) %>%
  adorn_ns() %>% 
  mutate_at(-1, list(~ str_replace(., "^([0-9.%]+)\\s+\\((\\d+)\\)", "\\2 (\\1)")))
# gender       blue blue-gray       brown      dark      hazel    yellow
#1 female 3 (33.33%) 0 (0.00%)  5 (55.56%) 0 (0.00%) 1 (11.11%) 0 (0.00%)
#2   male 9 (34.62%) 1 (3.85%) 12 (46.15%) 1 (3.85%)  1 (3.85%) 2 (7.69%)