如果两个单元格相同但顺序不同,则求和两行

时间:2018-08-16 10:33:00

标签: python-3.x pandas

类似于下面

Buyer Seller Amount
John  Mary   3
Mary  John   2
David Bosco  2

我想将John和Mary的行合计为一个

预期出来

Trade1 Trade2 Amount
John   Mary   5
David  Bosco  2

我的数据框大约有6000行。谢谢您的帮助

1 个答案:

答案 0 :(得分:1)

首先通过numpy.sort对值进行排序,然后通过DataFrame.duplicated创建布尔掩码,然后聚合sum

df[['Buyer','Seller']] = pd.DataFrame(np.sort(df[['Buyer','Seller']], axis=1))

df2 = df.groupby(['Buyer','Seller'], as_index=False)['Amount'].sum()
df2.columns = ['Trade1','Trade2','Amount']
print (df2)
  Trade1 Trade2  Amount
0  Bosco  David       2
1   John   Mary       5

如果不想修改原始列,请使用语法糖-groupbySeries

df1 = pd.DataFrame(np.sort(df[['Buyer','Seller']], axis=1))
df1.columns = ['Trade1','Trade2']

df2 = df['Amount'].groupby([df1['Trade1'],df1['Trade2']]).sum().reset_index()
print (df2)
  Trade1 Trade2  Amount
0  Bosco  David       2
1   John   Mary       5