我使用决策树(随机森林和随机树)对小数据集65x9进行了分类。我有四个类,8个属性和65个实例。
我的应用程序是辅助机器人技术。因此,我从传感器数据中提取一些参数,我认为这些参数与用户在执行某项任务时进行分类相关。我从部署在轮椅上的传感器包中获取运动数据。我将某些动作分类为转动180度,并给我一个标记(从1到4)所以从传感器包和软件我提取的参数如速度,距离,时间,速度的标准偏差等相关用于运行的用户的分类。所以我的数据都是数字。
当我执行决策树分类时,我得到了这个结果
=== Classifier model (full training set) ===
Random forest of 10 trees, each constructed while considering 4 random features.
Out of bag error: 0.5231
Time taken to build model: 0.01 seconds
=== Evaluation on training set ===
=== Summary ===
Correctly Classified Instances 64 98.4615 %
Incorrectly Classified Instances 1 1.5385 %
Kappa statistic 0.9791
Mean absolute error 0.0715
Root mean squared error 0.1243
Relative absolute error 19.4396 %
Root relative squared error 29.0038 %
Total Number of Instances 65
=== Detailed Accuracy By Class ===
TP Rate FP Rate Precision Recall F-Measure ROC Area Class
1 0 1 1 1 1 c1
1 0 1 1 1 1 c2
0.952 0 1 0.952 0.976 1 c3
1 0.019 0.917 1 0.957 1 c4
Weighted Avg. 0.985 0.003 0.986 0.985 0.985 1
=== Confusion Matrix ===
a b c d <-- classified as
14 0 0 0 | a = c1
0 19 0 0 | b = c2
0 0 20 1 | c = c3
0 0 0 11 | d = c4
这太好了。我做错了吗?