我为Kaggle上的(旧)比赛制作了一个简单的模型,进行拟合,预测和评分。现在我要提交。
我已经删去了我认为不需要解决的所有代码:
sample_sub = pd.read_csv('../input/web-traffic-time-series-forecasting/sample_submission_1.csv.zip')
key_1 = pd.read_csv('../input/web-traffic-time-series-forecasting/key_1.csv.zip')
pages_and_id = sample_sub.append(key_1, ignore_index=True)
pages_and_id.drop('Page', axis=1, inplace=True)
pages_and_id.fillna(0)
pages_and_id.drop_duplicates(inplace=True)
submission = pd.DataFrame(index=pages_and_id['Id'], data=predictions, columns=pages_and_id['Visits']) #predictions is simply the variable made earlier, when predicting on the model
最后一行导致此错误:
ValueError:传递的值的形状为(36266,182),索引暗示(17407560,17407560)
链接到kaggle上的原始作业:https://www.kaggle.com/c/web-traffic-time-series-forecasting/overview