Question

我有一个使用词袋为nlp构建朴素的贝叶斯分类器模型的方法。现在我要预测单个外部输入的输出。我该怎么办？请找到此github链接进行纠正，谢谢 https://github.com/Kundan8296/Machine-Learning/blob/master/NLP.ipynb

Answer 1

您需要应用与训练数据相同的预处理步骤，并将其用作分类器的输入。确保您不在新数据上使用fit_transform（），仅使用transform（）。

#Change this part in your preprocessing, so you can keep the original vectorizer.
vect = CountVectorizer(tokenizer=lambda doc: doc, lowercase=False)
bag_of_words = vect.fit_transform(corpus)
...
...
# Now when predicting, use this 
new_data = ... # your new input
new_x = vect.transform(new_data)
y_pred = classifier.predict(new_x)

如何预测单个外部输入文本在nlp（Restaurant Review）上应用的朴素贝叶斯分类器的输出？

1 个答案: