删除R中的元数据

时间:2015-03-04 10:19:38

标签: r text-mining tm

我有一个由报纸专栏组成的语料库;

library(tm)
inspect(corpus8)
[[69]]
<<PlainTextDocument (metadata: 7)>>
rec: 60  col: r3 ?investigate time, place, relationships and measurement concepts in < Aboriginal > and Torres Strait Islander contexts?. Add family breakdown, 

str(corpus8)
List of 71
 $ 1 :List of 2
  ..$ content: chr "rec: 7  col: r3 by dancing at a free concert and that dysfunction is fixed by changing the constitution to "
  ..$ meta   :List of 7
  .. ..$ author       : chr(0) 
  .. ..$ datetimestamp: POSIXlt[1:1], format: "2015-03-04 08:17:37"
  .. ..$ description  : chr(0) 
  .. ..$ heading      : chr(0) 
  .. ..$ id           : chr "1"
  .. ..$ language     : chr "en"
  .. ..$ origin       : chr(0) 
  .. ..- attr(*, "class")= chr "TextDocumentMeta"
  ..- attr(*, "class")= chr [1:2] "PlainTextDocument" "TextDocument"

我希望获得有关R代码的建议,以便在进行分析之前删除此元数据。唯一可见的文本采用以下形式:

rec: 6  col: r3 

感谢任何帮助。

0 个答案:

没有答案