#我的输入数据框(df)如下所示:
+----------+--------------------------+
| order_id | product_category_name |
+----------+--------------------------+
| 47585339 | Beverages |
| 47585339 | Snacks & Branded Foods |
| 47585339 | Foodgrains, Oil & Masala |
| 31076491 | Baby Care |
| 78258104 | Breakfast Cereals |
| 78258104 | Snacks & Namkeen |
| 78258104 | Pickles & Chutney |
+----------+--------------------------+
所需的输出数据框:
| order_id | 0 | 1 | 2
| | | |
| 47585339 | Beverages |Snacks & Branded Foods |Foodgrains, Oil & Masala
| 31076491 | Baby Care |nan | nan
| 78258104 | Breakfast Cereals |Snacks & Namkeen |Pickles & Chutney
#我通过分组依据进行了尝试:
groups = df.groupby('order_id')
df_result= pd.DataFrame(groups['product_category_name'])
#但是输出不是以期望的方式来的。除了使用分组方式之外,任何方法都是可以接受的。