模糊匹配并从字符串向量中提取字符串以完成数据帧

时间:2019-02-10 10:00:29

标签: r tidyverse fuzzy-search fuzzyjoin

我列出了一些法语名称,在语法上有一些细微的差别。

names <- c("Benoit", "Arnoud (son)", "Arnoud", "Arnous", "Archer, Patrice*", "Archer", "Archer (father)", "André" )

“ Arnoud(son)”,“ Arnoud”,“ Arnous”所有这些名称都属于同一家族。我希望能够创建一个数据框对象,以便按家庭对个人进行分组

people1           |people2 |people3  |people4|
"Benoit"          | NA     |NA       |NA
"Arnoud (son)",   |"Arnoud"|"Arnous" | NA
"Archer, Patrice*"|"Archer"| "Archer"|"Archer (father)"
"André"           | NA     | NA      |NA

0 个答案:

没有答案