将条件函数应用于R

时间:2017-04-06 21:09:53

标签: r function dataframe conditional lapply

我有一个数据框的数据框,如下所示:

> df
                               Var
1           word_1, word_2, word_3
2   word_1, word_2, word_3, word_4

> dput(df)
structure(list(df = list(structure(list(N = c("word_1", "word_2", "word_3")), 
.Names = "N", row.names = c(NA, -3L), class = "data.frame"), structure(list(N 
= c("word_1", "word_2", "word_3", "word_4")), 
.Names = "N", row.names = c(NA, -4L), class = "data.frame"))), .Names = "Var", 
row.names = c(NA, -2L), class = "data.frame") 

我想对数据应用一个函数,这样如果一个单词匹配一个条件,它就会被替换掉。我正在尝试这样的事情:

func_1 <- function(dataset, condition){
require(data.table)
setDT(dataset)[, lapply(.SD, function(x) ifelse(x == condition, "A", x))]
}

df <- lapply(df, func_1, condition = "word_2")

但我收到错误:

Error in matrix(unlist(value, recursive = FALSE, use.names = FALSE), nrow = 
nr,  : 
'df' must be of a vector type, was 'NULL'

我还需要一个类似func_1的函数,除了我希望能够替换单词中某处出现条件的单词。例如,func_2将包含"_"的任何单词替换为某个字符,例如B。任何指导将不胜感激!谢谢:))

1 个答案:

答案 0 :(得分:0)

以下是解决您第一个问题的dplyr解决方案:

condition <- "word_2"
library(dplyr)
mutate(df, Var = lapply(Var, mutate, N = ifelse(N == condition, "A", N)))
#                         Var
# 1         word_1, A, word_3
# 2 word_1, A, word_3, word_4

基地R的翻译:

"$<-"(df, Var, lapply(df$Var, function(x)
  "$<-"(x, N, ifelse(x$N == condition, "A", x$N))
))

由于您似乎使用data.table,我试图雕刻一些data.table等价物,但我对语法不太熟悉,因此它可能不是非常惯用的:

library(data.table)
DT <- as.data.table(df)
DT[, .(Var = list(as.data.table(Var)[, ifelse(N == condition, "A", N)])), by = seq_len(nrow(DT))]

对于您的第二个问题,它只是用N == condition替换grepl(condition, N)

mutate(df, Var = lapply(Var, mutate, N = ifelse(grepl("_", N), "B", N)))