Question

在这个超级简单的代码中，我希望dplyr首先通过我的分组变量（金发）然后按年龄排列，但它似乎根本不考虑分组变量。我很确定这在过去对我来说有所不同。我想要（并且期望）的是它首先由金发女郎安排（因为这是分组变量）然后是年龄。我使用的是dplyr_0.5.0。当我在分组变量上使用mutate进行测试时，它的行为符合预期，计算出一个分组均值。

我知道我可以按照金发女郎和年龄安排，但我认为dplyr的先前版本会在使用安排时考虑分组变量我是否错误记忆？

# In this code I expect it to order by the grouping
# variable first (blonde) then age.
df <- data.frame(blonde = c(0,1,0,1), 
                 age=24:21)

group_by(df, blonde) %>% arrange(age)

Source: local data frame [4 x 2]
Groups: blonde [2]

  blonde   age
   <dbl> <int>
1      1    21
2      0    22
3      1    23
4      0    24

Answer 1

您可以强制它使用该组：

df <- data.frame(blonde = c(0,1,0,1), 
                 age=24:21)

group_by(df, blonde) %>% arrange(age, .by_group = TRUE)

# A tibble: 4 x 2
# Groups:   blonde [2]
  blonde   age
   <dbl> <int>
1      0    22
2      0    24
3      1    21
4      1    23

dplyr安排不按小组安排

1 个答案: