Question

我正在尝试在创建机器学习模型之前使用LabelEncoder（）对数据帧进行编码

此处是代码：

from sklearn.preprocessing import LabelEncoder
# LabelEncoder
le = LabelEncoder()

# apply "le.fit_transform"
df_encoded = data1.apply(le.fit_transform)
print(df_encoded)
print(le.classes_)

但是我遇到了这个错误：

TypeError: ("'<' not supported between instances of 'str' and 'NoneType'", 'occurred at index SACC_MARKET_SEGMENT')

任何人都可以帮助ùme解决此问题？谢谢你

Answer 1

数据类型可能有问题。我不知道您想要的数据类型是什么，但是您可以尝试将data1转换为字符串：

from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()

df_encoded = le.fit_transform(data1.astype(str))
print(df_encoded)
print(le.classes_)

使用编码器使用python编码数据框

1 个答案: