Question

我在dplyr中使用了group_split，但在拆分了不止一列之后，我仍在努力为列表命名。

当我们将here一列分组时，我知道如何执行此操作，但是我不确定将两列拆分时如何执行此操作

我无法共享数据，但是如果使用虹膜数据集，它将与此类似（在我的情况下，两列都是因素）

iris %>%
group_split(Species, Petal.Width)

Answer 1

使用dplyr::group_keys()获取分组变量。

library(dplyr)
library(stringr)
# make grouped data frame
iris_group <- iris %>%
    group_by(Species, Petal.Width)

# get group keys
group_name_df <- group_keys(iris_group) %>%
    mutate(group_name = str_c(as.character(Species),"-",Petal.Width))

# get name for each group
gorup_name <- group_name_df$group_name

# assign name to each split table
df_list <- group_split(iris_group) %>%
    setNames(gorup_name)

> group_name_df
# A tibble: 27 x 3
   Species    Petal.Width group_name    
   <fct>            <dbl> <chr>         
 1 setosa             0.1 setosa-0.1    
 2 setosa             0.2 setosa-0.2    
 3 setosa             0.3 setosa-0.3    
 4 setosa             0.4 setosa-0.4    
 5 setosa             0.5 setosa-0.5    
 6 setosa             0.6 setosa-0.6    
 7 versicolor         1   versicolor-1  
 8 versicolor         1.1 versicolor-1.1
 9 versicolor         1.2 versicolor-1.2
10 versicolor         1.3 versicolor-1.3
# ... with 17 more rows

> df_list 
$`setosa-0.1`
# A tibble: 5 x 5
  Sepal.Length Sepal.Width Petal.Length Petal.Width Species
         <dbl>       <dbl>        <dbl>       <dbl> <fct>  
1          4.9         3.1          1.5         0.1 setosa 
2          4.8         3            1.4         0.1 setosa 
3          4.3         3            1.1         0.1 setosa 
4          5.2         4.1          1.5         0.1 setosa 
5          4.9         3.6          1.4         0.1 setosa 

$`setosa-0.2`
# A tibble: 29 x 5
   Sepal.Length Sepal.Width Petal.Length Petal.Width Species
          <dbl>       <dbl>        <dbl>       <dbl> <fct>  
.
.
.

当按多个列分组时，如何在dplyr中命名group_split的列表

1 个答案: