如何整理这个数据集?

时间:2017-11-20 20:55:40

标签: r reshape tidyr

我有下面的数据集,我想整理一下。

     user_id                topic may june july august september october
1     192775                 talk   2    0    0      2         2       1
2     192775                 walk 165  123  128    146       113     105
3     192775                 bark   0    0    0      0         0       0
4     192775                 harp   0    0    0      0         0       1

我想用tidyr塑造成以下格式。

user_id      month      talk      walk      bark      harp
192775       may           2       165         0         0
192775      june           0       123         0         0

感谢任何帮助

1 个答案:

答案 0 :(得分:5)

使用:

library(tidyr)
df %>% gather(month, val, may:october) %>% spread(topic, val)

你得到:

  user_id     month bark harp talk walk
1  192775    august    0    0    2  146
2  192775      july    0    0    0  128
3  192775      june    0    0    0  123
4  192775       may    0    0    2  165
5  192775   october    0    1    1  105
6  192775 september    0    0    2  113

另一种选择是使用recast - 包中的reshape2

library(reshape2)
recast(df, user_id + variable ~ topic, id.var = c('user_id','topic'))