Question

这是我的数据框DF的一瞥（）：

Observations: 221184
Variables:
$ Epsilon    (fctr) 96002.txt, 96002.txt, 96004.txt, 96004.txt, 96005.txt, 960...
$ Value   (int) 61914, 61887, 61680, 61649, 61776, 61800, 61753, 61725, 616...

我想使用dplyr过滤（删除）Epsilon的前两个级别的所有观察结果。

我的意思是：

DF %>% filter(Epsilon != "96002.txt" & Epsilon != "96004.txt")

但是，我不想使用字符串值（即＆＃34; 96002.txt＆＃34;和＆＃34; 96004.txt＆＃34;）但是级别订单（即1）和2），因为它应该是一个独立于水平值的一般指令。

Answer 1

您可以轻松地将factor转换为integer，然后在其上使用条件。只需将您的filter语句替换为：

即可

 filter(as.integer(Epsilon)>2)

更一般地说，如果您想要消除索引级别的向量，可以尝试：

 #some random levels we don't want
 nonWantedLevels<-c(5,6,9,12,13)
 #just the filter part
 filter(!as.integer(Epsilon) %in% nonWantedLevels)

使用dplyr过滤R中的因子水平

1 个答案: