在这里,我遇到了错误 ValueError:X每个样本具有12711个特征;期待18564 任何人都知道如何解决
i use TfidVectorizer to text for train data but when i predict my model folloe=wing error occure
Here's my code, m / 4Q0OO.png
data[:0]
>>>['Meals can be served in rooms at 9:00 p. m.']
data = co_ve.fit_transform(data)
data = transf.fit_transform(data)
from sklearn.linear_model import SGDClassifier
clf = SGDClassifier(loss ='hinge', alpha= 1e-3,max_iter= 5, tol= None)
model = clf.fit(data, label)
sol = pd.read_csv('Text.csv')
s_data = []
for i ,j in zip(sol['question'], sol['answer_text']):
i = i + ' ' + j
s_data.append(i)
s_data = co_ve.fit_transform(s_data)
s_data = transf.fit_transform(s_data)
ta = clf.predict(s_data)
我遇到以下错误
/usr/local/lib/python3.6/dist-packages/sklearn/linear_model/base.py in decision_function(self, X)
268 if X.shape[1] != n_features:
269 raise ValueError("X has %d features per sample; expecting %d"
ValueError: X has 12711 features per sample; expecting 18564