我有一个这样的数据框:
id text
1 DM,HTN,Enlarged prostate
2 hypertensive and on regular treatment
2 LBP
3 DM,HTN,Enlarged prostate
我想像这样组合同一个id的文本:
id text
1 DM,HTN,Enlarged prostate
2 hypertensive and on regular treatment LBP
3 DM,HTN,Enlarged prostate
结合了id 2的文本。 我怎么能这样做?任何帮助将不胜感激。
答案 0 :(得分:3)
您可以按功能分组来执行此操作
grouped_df = df.groupby("id").apply(lambda x: "%s" % ' '.join(x['text']))
答案 1 :(得分:2)
您可以使用groupby
和apply
函数join
。最后reset_index
:
grouped_df = df.groupby("id")['text'].apply(' '.join).reset_index()
print (grouped_df)
id text
0 1 DM,HTN,Enlarged prostate
1 2 hypertensive and on regular treatment LBP
2 3 DM,HTN,Enlarged prostate