Question

我有一个拥有约190,000行的Facebook数据的csv文件。列名如下：

comment_id, status_id, parent_id, comment_message, comment_author, comment_published, comment_likes, Positive, Negative, Sentiment

我想找出评论最多的comment_author（comment_message的＃）和Sentiment > 0。

有人知道如何使用R？

应用此过滤器

Answer 1

如果df是您的数据框，您可以使用dplyr包，如下所示：

df %>% group_by(comment_author,sentiment) %>%
       dplyr::summarize(total_number_comment=sum(comment_message)) %>%
       as.data.frame() %>%
       arrange(desc(total_number_comment)) %>%
       filter(sentiment>0)

我不明白你真正想用sentiment变量做什么（例如你需要提供一个例子），但是分组部分已经完成了

使用来自csv文件的R过滤数据

1 个答案: