Python熊猫groupby与lambda

时间:2019-11-24 03:06:18

标签: python pandas-groupby

我的数据框:

df = pd.DataFrame({'company': ['A', 'A', 'A', 'A', 'A','B','B','B','B','B'],
                      'offered': [1, 1, 0, 1, 1, 1, 0, 0, 1, 1],
                      'accepted': [0, 1, 0, 1, 1, 0, 0, 0, 1, 0]})

    company offered accepted
0   A       1       0
1   A       1       1
2   A       0       0
3   A       1       1
4   A       1       1
5   B       1       0
6   B       0       0
7   B       0       0
8   B       1       1
9   B       1       0

我希望我的最终结果如下:

df2 = df.groupby('company')[['offered', 'accepted']].agg('sum')
df2['accept_rate'] = df2['accepted']/df2['offered']
df2

    offered accepted    accept_rate
company         
A   4       3           0.750000
B   3       1           0.333333

但是,我想一次完成此操作(例如,使用Lambda)。这是我尝试过的

df['accept_rate'] = df['accepted'] / df['offered']
df.groupby('company')[['offered', 'accepted', 'accept_rate']].agg({'offered': 'sum',
                                                    'accepted': 'sum',
                                                    'accept_rate': lambda x: df['accepted'].sum()/df['offered'].sum()})

    offered accepted    accept_rate
company         
A   4       3           0.571429
B   3       1           0.571429

如您所见,accept_rate = 0.571429是针对全部/合并的公司。

如何使accept_rate看起来像我想要的最终结果?

谢谢!

1 个答案:

答案 0 :(得分:2)

您想要assign

df.groupby('company').agg({'offered': 'sum', 'accepted': 'sum'}).assign(accept_rate=lambda x: x['accepted']/x['offered'])