R grep存储在变量中的特殊字符

时间:2018-03-02 02:33:50

标签: r regex grep special-characters

假设我有一个字符向量a,如下所示:

a <- c(paste("P",1:5,"AA+1","a",8:12), paste("P",6:8,"BA-2","a",25:27), paste("P",9:16,"ZZ*3","a",25:32), paste("P",17:20,"CD/4","a",4:7), paste("P",21:24,"XY&9","a",112:113))
> a
 [1] "P 1 AA+1 a 8"    "P 2 AA+1 a 9"    "P 3 AA+1 a 10"   "P 4 AA+1 a 11"  
 [5] "P 5 AA+1 a 12"   "P 6 BA-2 a 25"   "P 7 BA-2 a 26"   "P 8 BA-2 a 27"  
 [9] "P 9 ZZ*3 a 25"   "P 10 ZZ*3 a 26"  "P 11 ZZ*3 a 27"  "P 12 ZZ*3 a 28" 
[13] "P 13 ZZ*3 a 29"  "P 14 ZZ*3 a 30"  "P 15 ZZ*3 a 31"  "P 16 ZZ*3 a 32" 
[17] "P 17 CD/4 a 4"   "P 18 CD/4 a 5"   "P 19 CD/4 a 6"   "P 20 CD/4 a 7"  
[21] "P 21 XY&9 a 112" "P 22 XY&9 a 113" "P 23 XY&9 a 112" "P 24 XY&9 a 113"

我需要通过gs检测存储在另一个字符向量grep中的3个组中的元素:

gs <- c("AA+1","ZZ*3","XY&9")

如您所见,问题是这些组包含特殊字符。

我知道我能做到:

grep("AA\\+1", a, val=TRUE)

但是当我想要grep的时候在向量中时呢?我不能这样做:

grep(gs[1], a, val=TRUE)
character(0)

1 个答案:

答案 0 :(得分:1)

只是提供一些背景并解决问题:

  • 选项fixed=TRUE定义模式是一个字符串,按字面顺序匹配,而不是正则表达式。该选项还会覆盖所有冲突的参数(doc)。

  • fixed=TRUE并不意味着它在bash中像grep -w一样工作,它会找到一个单独的字符串(由空格/边界包围)。该字符串仍然可以是源文本/字符向量中单词的子字符串。