Question

我有这个文件，我想用R

提取某些单词

 a = "people/NN + is/VB
no/AJ + one/NC + can/VA
certain/AJ + man/NN + is/VB

（每个字符串集由enter转义）

我想提取这些词（与verd＆＃34并置的词是＆＃34;）

"people" "man"

所以我尝试了这段代码，但没有工作

b = "([A-Za-z]+)/[N]\\s+\\sis/VB"
str_match_all(a, b)

请让我知道正则表达式

Answer 1

我认为下面的正则表达式可以帮到你。

r <- "[a-zA-Z]+(?=\\/NN\\s\\+\\s(is))"

我已完成与基础包的模式匹配的匹配，并且它适用于上面给出的示例。进一步测试。

m <- gregexpr(r,a,perl = TRUE)
regmatches(a,m)[[1]]

Answer 2

试试这个：

on open myItem
    set pathtobinary to [you'll need to figure out how to get the path to the binary inserted here as a function of myItem]
    do shell script "./upx.out -d " & quoted form of pathtobinary
end open

如何在R中提取具有空格和标点的某些单词

2 个答案: