我有2个数据帧1st(DF1),如下所示。
JID JRSubUsageLabel_16 SUB_USAGE_16
22 6223 JR_BOne_CY16 NaN
26 6510 JR_S_CY16 NaN
59 11932 JR_B_CY16 NaN
70 14242 JR_B_CY16 NaN
第二个数据帧(df2)如下
JID JR1_B_CY16 JR_CY16
1 1457 NaN NaN
2 1530 NaN NaN
3 1535 5 NaN
4 2035 NaN NaN
5 6223 5 NaN
6 6510 1.0 6
39 11932 1.0 NaN
40 12021 NaN NaN
41 12056 NaN NaN
42 14234 2 1.0
我想基于“ JID”列和JR1SubUsageLabel_16列值更新DF1数据帧。 JID是两个数据帧中的匹配列。在DF1中,“ JR1SubUsageLabel_16”列值成为DF2列。因此,DF2列之一也与DF1“ JR1SubUsageLabel_16”列值匹配。如下图所示。
JID JRLabel_16 SUB_USAGE_16
22 6223 JR1_B_CY16 5
26 6510 JR1_S_CY16 6
59 11932 JR1_B_CY16 1
70 14242 JR1_B_CY16 2
我正在尝试使用lambda更新它,而映射无法确切地知道如何更新它。谁能帮我吗?
预先感谢
答案 0 :(得分:3)
一种方法是merge
:
s = df2.melt('JID',value_name='SUB_USAGE_16',var_name='JR1SubUsageLabel_16')
df1.drop('SUB_USAGE_16', axis=1).merge(s, on = ['JID','JR1SubUsageLabel_16'], how='left')
输出:
JID JR1SubUsageLabel_16 SUB_USAGE_16
0 6223 JR1_BioOne_CY16 5.0
1 6510 JR1_Springer_CY16 6.0
2 11932 JR1_BioOne_CY16 1.0
3 14242 JR1_BioOne_CY16 NaN