ValueError:X每个样本具有12711个特征;期待18564

时间:2019-10-12 12:20:33

标签: machine-learning google-colaboratory python-textprocessing

在这里,我遇到了错误   ValueError:X每个样本具有12711个特征;期待18564 任何人都知道如何解决

i use TfidVectorizer to text for train data but when i predict my model folloe=wing error occure

Here's my code, m / 4Q0OO.png

data[:0]
>>>['Meals can be served in rooms at 9:00 p. m.']
data = co_ve.fit_transform(data)
data = transf.fit_transform(data)
from sklearn.linear_model import SGDClassifier
clf = SGDClassifier(loss ='hinge', alpha= 1e-3,max_iter= 5, tol= None)
model = clf.fit(data, label)
sol = pd.read_csv('Text.csv')
s_data = []
for i ,j in zip(sol['question'], sol['answer_text']):
    i = i + ' ' + j
    s_data.append(i)
s_data = co_ve.fit_transform(s_data)
s_data = transf.fit_transform(s_data)
ta = clf.predict(s_data)

我遇到以下错误

/usr/local/lib/python3.6/dist-packages/sklearn/linear_model/base.py in decision_function(self, X)
268         if X.shape[1] != n_features:
269             raise ValueError("X has %d features per sample; expecting %d"

ValueError: X has 12711 features per sample; expecting 18564

0 个答案:

没有答案