我的数据当前看起来像这样,列“基于每个不同Side_Effect的Number_Code:
Session_ID Side_Effect Number_Code
1 anxious 1
1 dizzy 2
1 relaxed 3
3 dizzy 2
7 nauseous 4
7 anxious 1
我知道我可以做到:
mutate(rn = str_c('side_effect_', row_number())) %>%
pivot_wider(names_from = rn, values_from = Side_Effect)
为了创建新的列名并将每个副作用放入这样的新列中:
session Number_Code side_effect1 side effect_2 side_effect_3
1 1 anxious NA NA
1 2 NA dizzy NA
1 3 NA NA relaxed
3 2 dizzy NA NA
7 4 nauseous NA NA
7 1 NA anxious NA
但是我需要基于“ Side_Effect”和“ Number_Code”来扩大数据,并将它们放在这样的交替列中:
session side_effect1 number_code1 side effect_2 number_code2 side_effect_3 number_code3
1 anxious 1 dizzy 2 relaxed 3
3 dizzy 2 NA NA NA NA
7 nauseous 4 anxious 1 NA NA
我看到另一篇文章,他们基于两个变量扩展了数据,但是第二个列的所有列都在第一个列的所有列之后。有没有办法让他们像这样交替?谢谢!!
答案 0 :(得分:3)
pivot_wider
可以占用多个value_from
列,因此在按组创建序列之后,请使用pivot_wider
和values_from
指定感兴趣的列
library(dplyr)
library(tidyr)
df1 %>%
group_by(Session_ID) %>%
mutate(rn = row_number()) %>%
ungroup %>%
pivot_wider(names_from = rn, values_from = c(Side_Effect, Number_Code))
# A tibble: 3 x 7
# Session_ID Side_Effect_1 Side_Effect_2 Side_Effect_3 Number_Code_1 Number_Code_2 Number_Code_3
# <int> <chr> <chr> <chr> <int> <int> <int>
#1 1 anxious dizzy relaxed 1 2 3
#2 3 dizzy <NA> <NA> 2 NA NA
#3 7 nauseous anxious <NA> 4 1 NA
如果需要重新排序列顺序,则可以根据数字部分select
order
df1 %>%
group_by(Session_ID) %>%
mutate(rn = row_number()) %>%
ungroup %>%
pivot_wider(names_from = rn, values_from = c(Side_Effect, Number_Code)) %>%
select(Session_ID, names(.)[-1][order(readr::parse_number(names(.)[-1]))] )
# A tibble: 3 x 7
# Session_ID Side_Effect_1 Number_Code_1 Side_Effect_2 Number_Code_2 Side_Effect_3 Number_Code_3
# <int> <chr> <int> <chr> <int> <chr> <int>
#1 1 anxious 1 dizzy 2 relaxed 3
#2 3 dizzy 2 <NA> NA <NA> NA
#3 7 nauseous 4 anxious 1 <NA> NA
df1 <- structure(list(Session_ID = c(1L, 1L, 1L, 3L, 7L, 7L),
Side_Effect = c("anxious",
"dizzy", "relaxed", "dizzy", "nauseous", "anxious"), Number_Code = c(1L,
2L, 3L, 2L, 4L, 1L)), class = "data.frame", row.names = c(NA,
-6L))