我正在尝试使用此answer中建议的列表将列添加到自定义类别中 -
bins = [0, 1, 5, 10, 25, 50, 100]
df = DataFrame({'Numbers':[0,1,2,7,11,16,45,200]})
df['Bins'] = pandas.cut(df['Numbers'], bins)
df
Numbers Bins
0 0 NaN
1 1 (0, 1]
2 2 (1, 5]
3 7 (5, 10]
4 11 (10, 25]
5 16 (10, 25]
6 45 (25, 50]
7 200 NaN
我如何装箱:
0为[0,1] 和200为(100,...)或> 100类?
答案 0 :(得分:4)
您应该添加np.inf
bins =[-np.inf,1, 5, 10, 25, 50, 100, np.inf]
df['Bins'] = pd.cut(df['Numbers'], bins,include_lowest =True)
df
Out[580]:
Numbers Bins
0 0 (-inf, 1.0]
1 1 (-inf, 1.0]
2 2 (1.0, 5.0]
3 7 (5.0, 10.0]
4 11 (10.0, 25.0]
5 16 (10.0, 25.0]
6 45 (25.0, 50.0]
7 200 (100.0, inf]