在R中的嵌套数据帧中多次调用ifelse

时间:2014-06-09 07:37:38

标签: r nested dataframe lapply

我有一个格式为的数据框:

LociDT4Length
[[1]]
   Cohort  V1
1:    CEU 237
2:  Lupus 203
3:     RA 298
4:    YRI 278

[[2]]
   Cohort   V1
1:    CEU  625
2:  Lupus  569
3:     RA 1022
4:    YRI  762

[[3]]
   Cohort  V1
1:    CEU 161
2:  Lupus 203
3:     RA 268
4:    YRI 285

[[4]]
   Cohort   V1
1:    CEU 1631
2:  Lupus 1363
3:     RA 1705
4:    YRI 1887

几天前,我学会了这个命令:

with(LociDT4Length[[1]], ifelse(Cohort=="RA", V1/62,
                         ifelse(Cohort=="Lupus", V1/62,
                         ifelse(Cohort=="CEU", V1/96,
                         ifelse(Cohort=="YRI", V1/80,NA)))))

适当地返回结果:

[1] 2.468750 3.274194 4.806452 3.475000

但是,我尝试将此语句置于循环中会为每个嵌套DF返回一个警告,并返回不正确的结果。错误消息是:

1: In `[<-.data.table`(x, j = name, value = value) :
  Coerced 'double' RHS to 'integer' to match the column's type; may have 
  truncated precision. Either change the target column to 'double' first 
  (by creating a new 'double' vector length 4 (nrows of entire table) and  
  assign that; i.e. 'replace' column), or coerce RHS to 'integer' (e.g. 1L,  
  NA_[real|integer]_, as.*, etc) to make your intent clear and for speed.
  Or, set the column type correctly up front when you create the table and 
  stick to it, please.

所以,我想弄清楚如何诱使R使用如下语句重复应用此语句:

for (i in 1:length(LociDT4Length)){
  with(LociDT4Length[[i]], ifelse(Cohort=="RA", V1/62,
                           ifelse(Cohort=="Lupus", V1/62,
                           ifelse(Cohort=="CEU", V1/96, 
                           ifelse(Cohort=="YRI", V1/80,NA)))))
}

或者我想使用lapply将此语句应用于此嵌套数组中的46个嵌套DF。

有什么建议吗?如果ifelse语法很差且很笨重,我也愿意改变它。

非常感谢。

2 个答案:

答案 0 :(得分:6)

这应该有效:

lapply(LociDT4Length, function(x)
  with(x,ifelse(Cohort %in% c("RA","Lupus"), V1/62,
                ifelse(Cohort=="CEU", V1/96,
                       ifelse(Cohort=="YRI", V1/80,NA)))))

要避免嵌套ifelse,请尝试以下操作:

#define cohort and matching divisor
origin=c("RA","Lupus","CEU","YRI")
divisor=c(62,62,96,80)

#avoid ifelse
lapply(LociDT4Length, function(x)
  with(x,V1/divisor[match(Cohort,origin)]))

答案 1 :(得分:3)

试试这个

myFun = function(x){with(x, ifelse(Cohort=="RA", V1/62,
                         ifelse(Cohort=="Lupus", V1/62,
                         ifelse(Cohort=="CEU", V1/96,
                         ifelse(Cohort=="YRI", V1/80,NA)))))}

results = lapply(LociDT4Length, myFun)