r - 如何按文档长度加权TermDocumentMatrix？ - Thinbug

如何按文档长度加权TermDocumentMatrix？

时间：2014-12-05 23:51:42

标签： r tm

我正在使用R包在tm中进行一些文本挖掘。我熟悉weightTfIdf作为加权选项：

dtm <- DocumentTermMatrix(docs, control = list(dictionary = c("argument"), weighting = weightTfIdf))

然而，我所处理的所有文件都包含我正在寻找的术语，但仅限于不同程度 - 因此TfIdf不起作用。如何按文件长度加权TermDocumentMatrix？

0 个答案:

没有答案