Question

我试图在输入文本中找到前10个常用单词与其余频繁单词之间的关联。

当我查看findAssocs()的个别输出时：

findAssocs(dtm, "good", corlimit=0.4)

通过打印“好”这个词来清楚地给出输出。

$good
 better     got    hook    next content     fit  person 
   0.44    0.44    0.44    0.44    0.43    0.43    0.43

但是当我尝试为具有前10个单词的字符向量自动执行此过程时：

t10 <- c("busi", "entertain", "topic", "interact", "track", "content", "paper", "media", "game", "good")

输出是每个元素的相关性列表，但没有关联这些元素的信息。样本输出如下（请注意，t10 [i]处的单词不打印，与上面单独的输出不同，其中'good'被清楚地打印出来）：

for(i in 1:10) {

   t10_words[i] <- as.list(findAssocs(dtm, t10[i], corlimit=0.4))
}


> t10_words
[[1]]
   littl descript  disrupt    enter    model 
    0.50     0.48     0.48     0.48     0.48 

[[2]]
  immers    anyth   effect     full holodeck      iot  problem      say startrek     such  suspect      wow 
    0.68     0.48     0.48     0.48     0.48     0.48     0.48     0.48     0.48     0.48     0.48     0.48 

[[3]]
         area        captur          give        overal          like          alon          avid         begin 
         0.51          0.47          0.47          0.47          0.44          0.43          0.43          0.43 
      circuit         cloud collaboration      communic     communiti        concis        confus         defin 
         0.43          0.43          0.43          0.43          0.43          0.43          0.43          0.43 
      discord        doesnt          drop enablesupport        esport         event         everi       everyon 
         0.43          0.43          0.43          0.43          0.43          0.43          0.43          0.43

如何打印输出以及实际的关联词？

有人可以帮我解决这个问题吗？

感谢。

Answer 1

运行for循环后，添加以下代码：

names(t10_words) <- t10

这将使用t10中指定的单词命名列表。

无法调整R

1 个答案: