随机森林sklearn-等于预测值

时间:2019-02-28 10:22:22

标签: random-forest prediction

房价挑战Kaggle 我正在尝试使用RandomForestClassifier预测价格。预测后,所有ID的价格相同。您对这个问题有想法吗?

clf = RandomForestClassifier(n_estimators=50)
clf = clf.fit(X_train,y_train)
clf.score(X_train, y_train)
X = df_test2[feature_cols]    
imp = Imputer(missing_values='NaN', strategy='mean', axis=0)
df_imp = imp.fit_transform(X)
df_test_scale = scaler.transform(df_imp)
y_pred = clf.predict(df_test_scale)
predict_prices = pd.DataFrame({"Id" : df_test2['Id'], "SalePrice":y_pred})
predict_prices

2 个答案:

答案 0 :(得分:0)

既然您缩放了训练集,请使用相同的缩放器缩放测试集。

答案 1 :(得分:0)

更改如下:

clf = RandomForestClassifier(n_estimators=50)
imp = Imputer(missing_values='NaN', strategy='mean', axis=0)
X_train_imp=imp.Transform(X_train)
X_train_scaled=scaler.transform(X_train_imp)
clf = clf.fit(X_train_scaled,y_train)
clf.score(X_train, y_train)
X = df_test2[feature_cols]    
df_imp = imp.fit_transform(X)
df_test_scale = scaler.transform(df_imp)
y_pred = clf.predict(df_test_scale)
predict_prices = pd.DataFrame({"Id" : df_test2['Id'], "SalePrice":y_pred})
predict_prices
相关问题