我有一个由报纸专栏组成的语料库;
library(tm)
inspect(corpus8)
[[69]]
<<PlainTextDocument (metadata: 7)>>
rec: 60 col: r3 ?investigate time, place, relationships and measurement concepts in < Aboriginal > and Torres Strait Islander contexts?. Add family breakdown,
str(corpus8)
List of 71
$ 1 :List of 2
..$ content: chr "rec: 7 col: r3 by dancing at a free concert and that dysfunction is fixed by changing the constitution to "
..$ meta :List of 7
.. ..$ author : chr(0)
.. ..$ datetimestamp: POSIXlt[1:1], format: "2015-03-04 08:17:37"
.. ..$ description : chr(0)
.. ..$ heading : chr(0)
.. ..$ id : chr "1"
.. ..$ language : chr "en"
.. ..$ origin : chr(0)
.. ..- attr(*, "class")= chr "TextDocumentMeta"
..- attr(*, "class")= chr [1:2] "PlainTextDocument" "TextDocument"
我希望获得有关R代码的建议,以便在进行分析之前删除此元数据。唯一可见的文本采用以下形式:
rec: 6 col: r3
感谢任何帮助。