如何合并具有相同标识符R的行?

时间:2018-07-25 15:49:52

标签: r merge dplyr tidy

我一直在搜索很多东西,但似乎找不到我想要的答案。这些行最初融为一体,然后我将它们散布起来,现在我有了一个数据框,看起来像这样: enter image description here

这是赔率:

structure(list(ID = c(1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L), 
    `first name` = c("Jamie", NA, NA, NA, NA, "sandra", NA, NA, 
    NA, NA), `last name` = c(NA, "Johns", NA, NA, NA, NA, NA, 
    "chan", NA, NA), q1_ans = c(NA, NA, "yes", NA, NA, NA, "yes", 
    NA, NA, NA), q2_ans = c(NA, NA, NA, "no", NA, NA, NA, NA, 
    "yes", NA), q3_ans = c(NA, NA, NA, NA, "yes", NA, NA, NA, 
    NA, "no")), row.names = c(NA, -10L), class = c("tbl_df", 
"tbl", "data.frame"), spec = structure(list(cols = list(ID = structure(list(), class = c("collector_integer", 
"collector")), `first name` = structure(list(), class = c("collector_character", 
"collector")), `last name` = structure(list(), class = c("collector_character", 
"collector")), q1_ans = structure(list(), class = c("collector_character", 
"collector")), q2_ans = structure(list(), class = c("collector_character", 
"collector")), q3_ans = structure(list(), class = c("collector_character", 
"collector"))), default = structure(list(), class = c("collector_guess", 
"collector"))), class = "col_spec"))

我拥有的实际数据框有更多的行和更多的列。我想将它们组合起来,以便ID 1的所有内容都在一行上,ID 2的所有内容都在一行上,依此类推。我已经尝试过了,但是还没到哪里

qr <- qr %>% 
  group_by(., ID) %>%
  rowwise() %>%
  summarise_all(funs(first(na.omit(.))))

我得到了错误:

Error in summarise_impl(.data, dots) : 
  Column `first name` must be length 1 (a summary value), not 0

我也尝试了dcast,但这也没有帮助。谢谢!

1 个答案:

答案 0 :(得分:0)

我们不需要rowwise。按“ ID”分组后,请在na.omit内使用summarise_all(假设每个列的“ ID”中仅包含一个非NA元素

qr %>%
    group_by(ID) %>%
    summarise_all(na.omit)
# A tibble: 2 x 6
#     ID `first name` `last name` q1_ans q2_ans q3_ans
#  <int> <chr>        <chr>       <chr>  <chr>  <chr> 
#1     1 Jamie        Johns       yes    no     yes   
#2     2 sandra       chan        yes    yes    no    

如果每个“ ID”的列中有多个非NA元素,则可以通过串联所有非NA元素来创建字符串

qr %>%
    group_by(ID) %>%
    summarise_all(funs(toString(na.omit(.))))

或创建一个list,然后执行unnest

qr %>%
   group_by(ID) %>%
   summarise_all(funs(list(na.omit(.))))