我正在尝试在R中编写ngram代码。当我使用stemDocument(corpus< -tm_map(corp_inputFile,stemDocument))时,代码工作正常。但是当我不使用stemDocument时会出错。
BigramTokenizer <- function(x) NGramTokenizer(x, Weka_control(min = 3, max = 3))
tdm <- TermDocumentMatrix(corpus, control = list(tokenize = BigramTokenizer))
forExport <- as.matrix(inspect(tdm))
write.csv(forExport, 'C:/myfile.csv')
错误讯息:
Error in .jcall("RWekaInterfaces", "[S", "tokenize", .jcast(tokenizer, :
java.lang.NullPointerException
请指导