Question

#Collecting Data from these columns
X = dataframe['Primary Type'].value_counts().loc['THEFT']
y = dataframe['Year'].value_counts()

#Dividing Data into Test and Train
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.2, random_state = 0)

#Trainng the Algorithm
regressor = LinearRegression()
regressor.fit(X_train, y_train)

#Making the Predictions
y_pred = regressor.predict(X_test)

#In the form of table
df= pd.DataFrame({'Actual': y_test, 'Predict': y_pred})
print (df)

训练模型时出现类型错误。我不明白如何解决这个问题。 X_train，X_test，y_train，y_test = train_test_split（X，y，test_size = 0.2，random_state = 0）我正在遇到问题。请帮帮我。

Answer 1

我猜测问题出在您提供的输入类型上。 values_count返回类型系列的结果。 test_train_split接受以下输入。

允许的输入是列表，numpy数组，稀疏矩阵或熊猫数据框。

要将值计数结果用作test_train_split的输入，请按以下步骤转换成列表。

jar

如果您至少要添加部分数据帧（如果它们太大而无法添加完整数据），我可以提供进一步的帮助。

TypeError：单例数组236724无法视为有效集合

1 个答案: