应用错误收集

我想可视化每个折痕的分布，以检查所有折痕是否具有大致代表性的分布。

到目前为止，我正在使用现有教程中的代码，但不确定如何在每个集合中查找特征并绘制分布，而无需遍历每个折叠的结果。

rf = RandomForestRegressor(n_estimators = 100, random_state = 42)
from sklearn.model_selection import StratifiedKFold
cv = StratifiedKFold(n_splits=3, random_state=42, shuffle=True)
for train_index, test_index in cv.split(features, labels):
    print("Train Index: ", train_index, "\n")
    print("Test Index: ", test_index)
    X_train, X_test, y_train, y_test = features[train_index],
features[test_index], labels[train_index], labels[test_index]         
    rf.fit(X_train, y_train)

对于每折我想输出每折每个特征的直方图（例如5个特征=每折5个直方图）。

有关如何执行此操作的任何信息都将有所帮助。

如何可视化每个StratifiedKFold中的特征分布

0 个答案: