Question

我有以下数据集，每一行都是一辆有5个值的汽车，最后价格是

我想用5个值添加一个新的汽车行，并根据从前几行中学到的价格来计算/预测价格。

这可以在excel或python中完成吗？

Answer 1

您正在寻找的是一种解决“回归”问题的方法。在Python和Excel中，有很多方法可以做到这一点。如果您通过“ Python回归机器学习”进行Google搜索，将会发现有关如何设置数据的大量帮助。

对于Python，我会尝试使用scikit-learn模块。示例代码可能如下所示：

from sklearn import linear_model
import pandas as pd

# assume the input dataset you have above is read into a pandas dataframe:
data = pd.read_csv('inputdata.csv')

X = data[['Value1','Value2','Value3','Value4','Value5']]
y = data['Price']
regr = linear_model.LinearRegression()

# Train the model using the training sets
regr.fit(X, y)

# now assuming some new set of data with the same columns as your training data 
X_test = pd.read_csv('inputdata.csv')[['Value1','Value2','Value3','Value4','Value5']]

# can generate predictions with
predictions = regr.predict(X_test)

从上面可以看到，建立某种模型可以预测新值的代码非常少。但是，该模型可能做得不是很好。要了解如何构建强大的模型，这已经超出了此问题的范围，但是for example上有许多在线资源可以帮助您完成此任务。

预测基于其他产品的新产品的价值？

1 个答案: