Question

我想要一种快速有效的方法，根据另一个数据帧中的值在['b'，'c']中为每行的'col_1'和'col_2'列中的值设置一个'id' 。以下是我尝试使用df.update进行此操作的简单示例。

data = {'id': ['a', 'b', 'b', 'c'],
        'col_0': ['e','f','g','h'],
        'col_1': ['m','n','o','p'],
        'col_2': ['q','r','s','t']}
df=pd.DataFrame.from_dict(data)
df

#the data frame dictating the changes to be made
cols=['col_1','col_2']
chg_dict={'b': ['b_0','b_1'],'c': ['c_0','c_1']}
chg_df=pd.DataFrame.from_dict(chg_dict,orient='index',columns=cols)
chg_df

#make the change
for chg in chg_df.index:
    #mask to get index where id is in chg_dict
    mask=[r for r in df.index if df.loc[r,'id']==chg]
    #this is apparently where I go wrong, nothing changes
    df.loc[mask,cols].update(chg_df)
df

我尝试使用和不使用cols索引器。

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.update.html

Answer 1

据我了解，您可以尝试做：

m=df.set_index('id')
m.update(chg_df)
df=m.reset_index()
print(df)

  id col_0 col_1 col_2
0  a     e     m     q
1  b     f   b_0   b_1
2  b     g   b_0   b_1
3  c     h   c_0   c_1

如何使用具有相同列名的向量设置某些行值？

1 个答案: