Question

在运行GridSearchCV时，我试图了解cv_validation_scores的输出。文档没有充分解释这一点。

当我打印grid_search.grid_scores_时，我会得到一个包含项目的列表，如下所示：

[mean: 0.60000, std: 0.18002, params: {'tfidf__binary': True, tfidf__ngram_range': (1, 1)....

这是有道理的。但是，当我尝试解压每个grid_scores实例时，我得到：

[0] same dictionary as above, makes sense
[1] score for all folds, makes sense
[2] a list that I don't understand, that looks like, "[ 0.75        0.33333333  0.66666667]"

这里报道的分数是多少？

Answer 1

正如我在邮件列表上发布的那样，the documentation非常清楚：

grid_scores_：命名元组列表

包含param_grid中所有参数组合的分数。每个条目对应一个参数设置。每个命名元组都具有以下属性：
        parameters, a dict of parameter settings
        mean_validation_score, the mean score over the cross-validation folds
        cv_validation_scores, the list of scores for each fold

这些是交叉验证中的每次折叠分数。

Answer 2

我取消订阅并重新订阅。似乎现在工作

了解SciKit了解CV验证分数

2 个答案: