应用错误收集

我对sklearn的LabelEncoder有疑问。

df =
    AIRLINE WAKE    WEEKDAY
    AEA     H       6   
    AEA     H       6       
    AEA     H       6   
    AEA     H       6   
    AEA     H       4

这是我创建编码器的方式：

categorical_features = ["AIRLINE","WAKE"]
enc = preprocessing.LabelEncoder()
X_categorical = df[categorical_features].apply(enc.fit)
joblib.dump(enc, "encoders/labelencoder.pkl")

然后我将其加载并应用如下：

label_encoder = joblib.load('encoders/labelencoder.pkl')
label_encoder.handle_unknown = 'ignore'
categorical_features = ['AIRLINE','WAKE','WEEKDAY']
test = df[categorical_features].apply(label_encoder.transform)

发生错误：

ValueError ：（“ y包含以前看不见的标签：'AEA'“，' 在索引AIRLINE'）

看起来labelencoder.pkl包含最后一列的编码器。如何保存所有编码器然后使用它们？

LabelEncoder：ValueError-y包含以前看不见的标签

0 个答案: