在R中搜索句子中的单词

时间:2015-03-03 09:21:43

标签: regex r

我想请教您以下事项的建议。我有一个数据框:

reviews <- data.frame(value = c("Product was received in excellent condition. Made with high quality materials. Very Good product",
                            "Inexpensive. An improvement over integrated graphics.",
                            "I love that product so excite. I will order again if I need more .",
                            "Excellent card, great graphics."),
                            user = c(1,2,3,4),
                            Review_Id = c("101968","101968","210546","112546"))

然后我从上面提到的每个句子中都有一个主题:

topics <- data.frame(topic = c("product","condition","materials","product","integrated graphics","product","card","graphics"),
                 user = c(1,1,1,1,2,3,4,4), Review_Id = c("101968","101968","101968","101968","101968","210546","112546","112546"))

如果我知道用户和Review_Id的句子和主题,我需要找到特定主题出现的原始句子。然后将此句写入专栏评论。

所需的输出应如下所示。

            topic      user    Review_Id                                   review
            product       1    101968     Product was received in excellent condition.
            condition     1    101968     Product was received in excellent condition.
            materials     1    101968                Made with high quality materials.
            product       1    101968                               Very Good product
  integrated graphics     2    101968         An improvement over integrated graphics.
            product       3    210546                   I love that product so excite.
               card       4    112546                  Excellent card, great graphics.
            graphics      4    112546                  Excellent card, great graphics.

非常感谢任何建议或方法。非常感谢前进。

1 个答案:

答案 0 :(得分:0)

你可以尝试

merge.data.frame(x = topics, y = reviews, by = c("Review_Id"), all.x = TRUE, all.y = FALSE)