实施例

#!/usr/bin/env python

"""Test data frame grouping."""

# 3rd party modules
import pandas as pd


df = pd.DataFrame([{'id': 1, 'price': 123, 'name': 'anna', 'amount': 1},
                   {'id': 1, 'price':   7, 'name': 'anna', 'amount': 2},
                   {'id': 2, 'price':  42, 'name': 'bob', 'amount': 30},
                   {'id': 3, 'price':   1, 'name': 'charlie', 'amount': 10},
                   {'id': 3, 'price':   2, 'name': 'david', 'amount': 100}])
print(df)

给出数据框：

   amount  id     name  price
0       1   1     anna    123
1       2   1     anna      7
2      30   2      bob     42
3      10   3  charlie      1
4     100   3    david      2

我想得到：

amount  id     name  price
     3   1     anna    130
    30   2      bob     42
   110   3  charlie      3

所以：

id列中具有相同值的条目属于一起。在该操作之后，应该仍然存在id列，但它应该只有唯一值。
amount和price中具有相同id总结的所有值
对于name，仅采用第一个（按数据帧的当前顺序）。

Pandas可以吗？

Answer 1

您正在寻找

aggregation_functions = {'price': 'sum', 'amount': 'sum', 'name': 'first'}
df_new = df.groupby(df['id']).aggregate(aggregation_functions)

给出了

    price     name  amount
id                        
1     130     anna       3
2      42      bob      30
3       3  charlie     110

Answer 2

对于相同的列，必须添加reindex，因为dict汇总了

d = {'price': 'sum', 'name': 'first', 'amount': 'sum'}
df_new = df.groupby('id', as_index=False).aggregate(d).reindex(columns=df.columns)
print (df_new)
   amount  id     name  price
0       3   1     anna    130
1      30   2      bob     42
2     110   3  charlie      3

如何使用聚合函数将Pandas中的列中的行“合并”相同的值？

实施例

2 个答案: