Question

我是R.的新手。我必须在我的代码中使用POSTagger。我正在使用openNLP和R.尝试下面的示例代码（在Sample.R文件中）：

library("NLP")
library("openNLP")
s <- paste(c("Pierre Vinken, 61 years old, will join the board as a ",
"nonexecutive director Nov. 29.\n",
"Mr. Vinken is chairman of Elsevier N.V., ",
 "the Dutch publishing group."),
 collapse = "")
s <- as.String(s)

sent_token_annotator <- Maxent_Sent_Token_Annotator()
a1 <- annotate(s, sent_token_annotator)
s[a1]

从R控制台运行此代码（使用源代码（＆＃34; Sample.R＆＃34;））我收到了以下错误：

Error in as.data.frame.default(x[[i]], optional = TRUE) : 
cannot coerce class "c("Simple_POS_Tag_Annotator", "Annotator")" to a data.frame

以下是traceback（）命令的输出：

14: stop(gettextf("cannot coerce class \"%s\" to a data.frame", deparse(class(x))), 
    domain = NA)
13: as.data.frame.default(x[[i]], optional = TRUE)
12: as.data.frame(x[[i]], optional = TRUE)
11: data.frame(x = function (s, a = Annotation()) 
{
    s <- as.String(s)
    y <- f(s)
    n <- length(y)
    id <- .seq_id(next_id(a$id), n)
    type <- rep.int("sentence", n)
    if (is.Annotation(y)) {
        y$id <- id
        y$type <- type
    }
    else if (is.Span(y)) {
        y <- as.Annotation(y, id = id, type = type)
    }
    else stop("Invalid result from underlying sentence tokenizer.")
    if (length(i <- which(a$type == "paragraph"))) {
        a <- a[i]
        a$features <- lapply(annotations_in_spans(y, a), function(e) list(constituents = e$id))
        y <- c(y, a)
    }
    y
}, check.names = FALSE, stringsAsFactors = FALSE)
10: eval(expr, envir, enclos)
9: eval(as.call(c(expression(data.frame), x, check.names = !optional, 
   stringsAsFactors = stringsAsFactors)))
8: as.data.frame.list(x[[i]], optional = TRUE, stringsAsFactors = stringsAsFactors)
7: as.data.frame(x[[i]], optional = TRUE, stringsAsFactors = stringsAsFactors)
6: data.frame(position)
5: annotate(s, sent_token_annotator) at sample.R#11
4: eval(expr, envir, enclos)
3: eval(ei, envir)
2: withVisible(eval(ei, envir))
1: source("sample.R")

可能出错的是什么？我在Windows 7上使用Rx64 3.1.1。将非常感谢任何帮助。提前谢谢。

Answer 1

我有同样的问题，我通过删除/分离ggplot2包修复它。 ggplot2中有一个名为Annotate的函数，它在两个包中都是相同的名称。我建议你确保它正在查看库中的正确函数...在我的情况下，它正在查看ggplot2的Annotate函数而不是NLP包。

Answer 2

我没有确切的答案，但使用NLP，openNLP，tm，qdap遭遇了同样的错误。我向后工作重新启动R并加载（库）一个包，运行代码，然后加载另一个包并运行代码，直到我遇到“无法强制转换为数据帧”错误。我发现，在我的情况下，qdap会干扰openNLP annotate（）函数调用 - 它实际上是使用NLP包装器。

openNLP版本0.2-3导入NLP（≥0.1-2），openNLPdata（≥1.5.3-1）和rJava（≥0.6-3）。因为您明确加载了NLP，所以可能是两个NLP实例在内存中运行相互干扰的情况。尝试加载openNLP并运行代码

Answer 3

多个包具有相同的名称。如果您明确告诉R使用哪个包，它可能会解决问题。例如，请尝试使用Arrange(...)

，而不是openNLP::Arrange(...)

R：annotate（）在R中给出错误

3 个答案: