增加主题模型输出的可能性

时间:2018-08-01 08:33:21

标签: python lda topic-modeling

我刚遇到以下主题模型文章:https://medium.com/ml2vec/topic-modeling-is-an-unsupervised-learning-approach-to-clustering-documents-to-discover-topics-fdfbf30e27df

该脚本包含一个函数,该函数将各个主题模型和相应的单词放入表格中:

def get_lda_topics(model, num_topics):
    word_dict = {};
    for i in range(num_topics):
        words = model.show_topic(i, topn = 20);
        word_dict['Topic # ' + '{:02d}'.format(i+1)] = [i[0] for i in words];
    return pd.DataFrame(word_dict);

问题:是否可以将单词的LDA概率添加到表输出中?

Topic 1
car (0.050)
bike (0.035)
skateboard (0.020)
...

0 个答案:

没有答案