在数据框上应用group_by / do时出错,小标题/分配

时间:2020-08-07 08:35:49

标签: r group-by dplyr assign tibble

我正在通过dplyr使用以下过程在分组的数据帧上应用函数:

Input2 <- data.frame(ens=rep(as.character(1:51), each=114),
                    dates_UTC= rep(as.character(seq(as.POSIXct("2013-01-01 07:00:00"), as.POSIXct("2013-01-06 00:00:00"), by="1 hour")), 51),
                    LE = sample(c(0,0,0,0,0,0,0,0,0.005,0.004,0.003,0.002,0.001), 114*51, replace=T),
                    ETPh = rep(0, 114*51),
                    ech = rep(1:114, 51),
                    NiveauResR = rep(c(32.1, rep(NA, 113)), each=51),
                    NiveauResS = rep(c(223, rep(NA, 113)), each=51),
                    HU1=rep(c(0.028, rep(NA, 113)), each=51),
                    HU2=rep(c(0, rep(NA, 113)), each=51),
                    HU3=rep(c(0, rep(NA, 113)), each=51),
                    HU4=rep(c(0, rep(NA, 113)), each=51),
                    HU5=rep(c(0, rep(NA, 113)), each=51),
                    HU6=rep(c(0, rep(NA, 113)), each=51))

Qmm_prev <- group_by(Input2, ens) %>%
  dplyr::do(data.frame(dates_prev =.$dates_UTC, Q = test2(.))) %>%
  unnest(cols=c())

具有(从现实简化)

test2 <- function(x){
  Qmm_prev <- vector(length=nrow(x))
  for (ech in 1:nrow(x))
  {
   if (ech < 114){     
      x[ech, 8:ncol(x)] <- c(0,0,0,0,0,0)
    }
    Qmm_prev[ech] <- 10
  }
  return(Qmm_prev)
}

我遇到以下错误:

Error: Assigned data `c(0, 0, 0, 0, 0, 0)` must be compatible with row subscript `ech`.
x 1 row must be assigned.
x Assigned data has 6 rows.
i Row updates require a list value. Do you need `list()` or `as.list()`?
Run `rlang::last_error()` to see where the error occurred. 

该代码在几个月前开始工作,如果我通过“ ens”上的循环替换group_by / do代码,它也可以工作。我相信group_by / do在语法上有问题,但是我找不到它……我知道这是从以下行得出的:

 x[ech, 8:ncol(x)] <- c(0,0,0,0,0,0)

但是由于它在循环测试时可以正常工作,所以我没有发现问题以及如何解决...

有人知道吗?

谢谢

1 个答案:

答案 0 :(得分:0)

按照错误消息中的说明进行操作:

从以下位置更改行:

x[ech, 8:ncol(x)] <- c(0,0,0,0,0,0)

x[ech, 8:ncol(x)] <- as.list(0,0,0,0,0,0)

并且代码按预期工作:

Qmm_prev <- group_by(Input2, ens) %>%
  dplyr::do(data.frame(dates_prev =.$dates_UTC, Q = test2(.)))

但是,请注意,do已被取代,并且可能有更好的书写方式test2