我有一个数据框的数据框,如下所示:
> df
Var
1 word_1, word_2, word_3
2 word_1, word_2, word_3, word_4
> dput(df)
structure(list(df = list(structure(list(N = c("word_1", "word_2", "word_3")),
.Names = "N", row.names = c(NA, -3L), class = "data.frame"), structure(list(N
= c("word_1", "word_2", "word_3", "word_4")),
.Names = "N", row.names = c(NA, -4L), class = "data.frame"))), .Names = "Var",
row.names = c(NA, -2L), class = "data.frame")
我想对数据应用一个函数,这样如果一个单词匹配一个条件,它就会被替换掉。我正在尝试这样的事情:
func_1 <- function(dataset, condition){
require(data.table)
setDT(dataset)[, lapply(.SD, function(x) ifelse(x == condition, "A", x))]
}
df <- lapply(df, func_1, condition = "word_2")
但我收到错误:
Error in matrix(unlist(value, recursive = FALSE, use.names = FALSE), nrow =
nr, :
'df' must be of a vector type, was 'NULL'
我还需要一个类似func_1
的函数,除了我希望能够替换单词中某处出现条件的单词。例如,func_2
将包含"_"
的任何单词替换为某个字符,例如B
。任何指导将不胜感激!谢谢:))
答案 0 :(得分:0)
以下是解决您第一个问题的dplyr
解决方案:
condition <- "word_2"
library(dplyr)
mutate(df, Var = lapply(Var, mutate, N = ifelse(N == condition, "A", N)))
# Var
# 1 word_1, A, word_3
# 2 word_1, A, word_3, word_4
基地R的翻译:
"$<-"(df, Var, lapply(df$Var, function(x)
"$<-"(x, N, ifelse(x$N == condition, "A", x$N))
))
由于您似乎使用data.table
,我试图雕刻一些data.table
等价物,但我对语法不太熟悉,因此它可能不是非常惯用的:
library(data.table)
DT <- as.data.table(df)
DT[, .(Var = list(as.data.table(Var)[, ifelse(N == condition, "A", N)])), by = seq_len(nrow(DT))]
对于您的第二个问题,它只是用N == condition
替换grepl(condition, N)
:
mutate(df, Var = lapply(Var, mutate, N = ifelse(grepl("_", N), "B", N)))