我有一个单词,频率和亲和力词典的列表,正在尝试进行计数和分组,但不确定如何在代码中包括频率。这2行中的每行都有效,除了不计算添加项中的freq列,而且我不确定该怎么做。
ddply(summaryLex,~sentiment,summarise,frequency=length(unique(word)))
sqldf("SELECT sentiment, COUNT(sentiment) as totalsent from summaryLex GROUP BY sentiment")
summaryLex csv file][1]
summaryLex文件:
[] [2
] https://drive.google.com/open?id=15KBebiqXsNnndOP2mzoaxnvx1nk8Z8vL
答案 0 :(得分:1)
如果data.table:
data[, sum(freq), by = sentiment]
如果是dplyr:
data %>%
group_by(sentiment) %>%
summarise(sum = sum(freq))