应用错误收集

首先：我已经检查了Using R: Make a new column that counts the number of times 'n' conditions from 'n' other columns occur，我相信这是不同的。

我有一个包含学生的大量数据集（1,304,708个观察点）。测试成绩和他们所在教室的信息，我需要知道每组有多少学生。我的数据表称为data_means，我正在尝试创建group_size。 X.2是每个学生（一个数字）的唯一标识符，而课堂是表示他们课堂的因子变量。

我需要像第三列（group_size）

这样的东西

X.2    classroom      group_size
1      09PTV0002Q     3 
2      09PTV0002Q     3
3      09PTV0002Q     3
4      09PTV0007B     2
5      09PTV0007B     2
7      15PTV0014Z     4
8      15PTV0014Z     4
9      15PTV0014Z     4
10     15PTV0014Z     4


data_means$group_size <-data_means[, Count := .N, by = list(X.2, classroom)]

生成一列，其中包含R中列中值的值

0 个答案: