Question

我正在使用scikit学习文档分类，我想知道是否有办法预测多个classe（具有相应的概率）

以下是我正在使用的代码：

vectorizer = CountVectorizer(max_df=4000, min_df=4, strip_accents="unicode", analyzer="word", max_features=4000,stop_words=stopwords, charset="utf-8", token_pattern="\w{4,}") 
x1_vect=vectorizer.fit_transform(x1)

clf = OneVsRestClassifier(LinearSVC())
#or
#clf = MultinomialNB()

clf.fit(x1_vect,y1)
prediction = clf.predict(xpred_vect)

我只接受预测，并希望有更多（具有相应的概率）

Answer 1

事实上（我回答了我自己的问题），对于某些分类器（我使用过的linearSVC），有一种叫做

的方法

clf.decision_function

给出“样本的置信度得分，即该样本与超平面的有符号距离”

Answer 2

linearSVC有.decision_function()。其他一些分类器使用.predict_proba()

给出0到1之间的预测

用分类器进行多个预测的方法

2 个答案: