我创建了一个代码来比较两个不同政客的推文。我创建了一个包含它们的主语料库。我正在使用朴素贝叶斯方法。我不断收到相同的错误消息,但我不知道如何适应,我将在下面显示代码。任何意见,将不胜感激!谢谢
set.seed(300)
id_train <- sample(1:5000, 1500, replace = FALSE)
head(id_train, 10)
MasterCorpus$id_numeric <- 1:ndoc(MasterCorpus)
dfmat_training <- corpus_subset(MasterCorpus, id_numeric %in% id_train) %>%
dfm(remove = stopwords("en"), stem = TRUE)
dfmat_test <- corpus_subset(MasterCorpus, !id_numeric %in% id_train) %>%
dfm(remove = stopwords("english"), stem = TRUE)
tmod_nb <- textmodel_nb(dfmat_training, dfmat_training$author)
但是,在最后一行代码之后,我不断收到此错误消息:
tmod_nb <- textmodel_nb(dfmat_training, dfmat_training$author)
Error in if (stats::var(as.numeric(y), na.rm = TRUE) == 0) stop("y cannot be constant") :
missing value where TRUE/FALSE needed