数据框中两列之间的字符串匹配

时间:2015-12-09 06:18:01

标签: r string-matching

我有一个包含“地址”,“区域”和“州”的数据帧(df)为三列:

输入:

Address                                  District       State
132, 1st block, Mysore,Karnataka         Mysore        Karnataka
24, 4th Block, Jayanagar India           Bangalore     Karnataka
Prestige owen, M.G Road                  Bangalore     Karnataka
Opp: Reliance trend, Mantri Mall,-Delhi  New Delhi     New Delhi

基本上我想标记(作为新列)“地址”栏下“实体”栏下的实体所在的行,

预期产出:

Address                             District    State     Dist_match 
132, 1st block, Mysore,Karnataka    Mysore      Karnataka  TRUE
24, 4th Block, Jayanagar India      Bangalore   Karnataka  FALSE
Prestige owen, M.G Road             Bangalore   Karnataka  FALSE
Opp: Reliance trend, 
Mantri Mall,-Delhi                 New Delhi    New Delhi  TRUE

我尝试了以下但是收到了警告并且效果不佳

 df$Dist_match  <- mapply(grepl, pattern=df$District, x=df$Address)

0 个答案:

没有答案