Question

所以我对这个函数的理解是将表分成两部分，然后比较这些值以确定预测率

让我们说我有一张桌子：

Column1   Column2   Column3   Column4   Column5 
3          2          2          43         0
1          2          2          23         1
5          5          2          56         1
4          3          2          13         0
6          1          2          11         1

"Column 5" is label 0 or 1

我知道前3行是100％正确的，因为我手动为其分配了标签，但是第4行和第5行是使用随机森林分类器进行标签的。我想看看预测率是什么

我想使用classification_report（y_true，y_pred，target_names = target_names），我的“ y_true”，“ y_pred”将是什么？我假设target_names = 0,1

Answer 1

y_true是样本的真实标签，y_pred是在模型中进行的预测。 target_name可让您为类标签分配自定义名称

classification_report报告模型的精度，召回率，f1得分和支持。

示例如sklearn中所示 https://scikit-learn.org/stable/modules/generated/sklearn.metrics.classification_report.html

from sklearn.metrics import classification_report
y_true = [0, 1, 2, 2, 2]
y_pred = [0, 0, 2, 2, 1]
print(classification_report(y_true, y_pred, target_names=target_names))

例如，类别0的精度为“正正” /“正正+误正”，其精度为1/2 = 0.5

在Sklearn中使用Classification_Report函数

1 个答案: