Question

我试图做多个分组，并且还在新列中增加计数。

我的输入文件

OrderDate   Region  Rep Item    Units   Unit Cost   Total
----------------------------------------------------------
1/6/18      East    Jones   Pencil  95   1.99    189.05 
1/23/18     Central Kivell  Binder  50   19.99   999.50 
2/9/18      Central Jardine Pencil  36   4.99    179.64 
2/26/18     Central Gill    Pen 27   19.99   539.73 
3/15/18     West    Sorvino Pencil  56   2.99    167.44 
4/1/18      East    Jones   Binder  60   4.99    299.40 
4/18/18     Central Andrews Pencil  75   1.99    149.25 
4/18/18     West    Jones   Pencil  75   1.99    149.25

我想做喜欢的

Region Rep Count same/diff
-------------------------------
east jones  2    2-same
     jones
central Kivell  4 >3 differnce
        Jardine
        Gill    
        Andrews 
West  Sorvino 2   2-different
West  jones1

我的代码：

df1 = pd.read_excel(excel_path, sheet_name = 'SalesOrders', index_col=0)
df3 = (df1.groupby('Region')['Rep'].value_counts())

print(df3)

请帮助我做到这一点。谢谢在rep列中，基于Region我已完成分组依据以了解Rep值。如果Rep成员相同，则2个相同的人，则认为中心区域有4个不同的人在工作，所以我大于3个。

熊猫很难添加新列？

0 个答案: