列表中的短语相似度

时间:2018-09-17 14:32:08

标签: python list dataframe similarity difflib

您好,假设我有2个列表:

names = ['Daniel', 'Mario', 'Mandy', 'Jolene', 'Fabio']
places = ['on top of the table', 'France', 'valley of the kings']

和带有一些句子的数据框 例如:

DataframeOrig

Index | Sent
0     | Mandy went to France on the Eiffel Tower
1     | Daniele was dancing on top of the box
2     | I am eating on top of the table
3     | Maria went to the valley of the kings

我想使用difflib之类的距离度量来扫描句子并将短语与具有确定偏移量的列表进行比较。希望这样做的结果将是:

Index | Sent                                     | Result
0     | Mandy went to France on the Eiffel Tower | Mandy
1     | Daniele was dancing on top of the box    | Daniel
2     | I am eating on top of the table          | on top of the table
3     | Maria went to the valley of the kings    | Mario, valley of the kings

在不使用循环加载获取词组匹配的情况下如何解决?

0 个答案:

没有答案