是Snowball& R的SnowballC套餐有什么不同?

时间:2014-05-07 20:58:47

标签: r stemming tm snowball

我使用stemDocument在R中使用tm包来阻止文本文档。示例代码:

data("crude")
crude[[1]]
stemDocument(crude[[1]])

我收到错误消息:

  

loadNamespace(name)出错:没有名为'Snowball'的包

我已经安装了SnowballC包,无法找到Snowball包。以下是我的sessionInfo()

R version 2.15.3 (2013-03-01)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)

locale:
[1] en_CA.UTF-8/en_CA.UTF-8/en_CA.UTF-8/C/en_CA.UTF-8/en_CA.UTF-8

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] SnowballC_0.5 tm_0.5-8.3   

loaded via a namespace (and not attached):
[1] slam_0.1-31  tools_2.15.3

是否需要任何其他套餐或Snowball?

2 个答案:

答案 0 :(得分:8)

尝试将SnowballC个软件包安装到R

install.packages("SnowballC")

library(SnowballC)

它应该有用。

答案 1 :(得分:5)

您有pkg:tm的旧版本。当前版本的tm有一个DESCRIPTION文件,将SnowballC列为“建议”。旧版本建议使用Snowball。

Package: tm
Title: Text Mining Package
Version: 0.5-10
Date: 2014-01-07
Authors@R: c(person("Ingo", "Feinerer", role = c("aut", "cre"),
                    email = "feinerer@logic.at"),
             person("Kurt", "Hornik", role = "aut"),
             person("Artifex Software, Inc.", role = c("ctb", "cph"),
                    comment = "pdf_info.ps taken from GPL Ghostscript"))
Depends: R (>= 2.14.0)
Imports: parallel, slam (>= 0.1-31)
Suggests: filehash, proxy, Rcampdf, Rgraphviz, Rpoppler, SnowballC, XML

这是您目前从CRAN获得的消息:

Package ‘Snowball’ was removed from the CRAN repository.

Formerly available versions can be obtained from the archive.

Archived on 2014-03-16 at the request of the maintainer. 

您应该更新到tm的当前版本。试试这个:

update.packages("tm",  checkBuilt = TRUE)