Question

我目前正在使用R中的正则表达式来删除以*开头的行，例如：

* Wikipedia started from the public domain version reprinted by the [http://www.ccel.org/ Christian Classics Ethereal Library].  
* James William Richard (1898). From [[Internet Archive]].
* [http://www.melanchthon.de/e/ The Phillip Melanchthon Quinquennial]

我尝试使用函数 gsub 和regex表达式，如：

gsub("^[\\*]+[\\s\\[A-Za-z,;'\"\\s]+[.?!\\]]$","",tex1)

但什么都没发生。能帮我解决一下这个表达式的问题吗？

Answer 1

删除以$result = strip_tags($xml, "<corpo></corpo>";

开头的行

Answer 2

如果你有一个字符串v的向量，最好搜索一个向量的元素是否包含你要查找的主题，所以，你需要grepl而不是{{} 1}}。

你可以这样做：

gsub

如何使用R中的正则表达式删除以*开头的行

2 个答案: