SSAS数据挖掘:测试和培训数据集......请解释

时间:2013-04-02 20:28:29

标签: ssas

有人可以解释在拆分测试和培训数据集时会发生什么吗?

2 个答案:

答案 0 :(得分:1)

简单地说,数据挖掘模型的准确性是通过根据您的训练集进行预测来评估的,其结果在测试集中已知。

More information on the testing and validation of data mining models (MSDN)

答案 1 :(得分:0)

为了能够测试您构建的预测分析模型,您需要将数据集拆分为两组:训练和测试数据集。这些数据集应随机选择,应该是实际人口的良好代表。

Similar data should be used for both the training and test datasets.

Normally the training dataset is significantly larger than the test dataset.

Using the test dataset helps you avoid errors such as overfitting.

The trained model is run against test data to see how well the model will perform.

More Information