该脚本包含一个函数,该函数将各个主题模型和相应的单词放入表格中:
def get_lda_topics(model, num_topics):
word_dict = {};
for i in range(num_topics):
words = model.show_topic(i, topn = 20);
word_dict['Topic # ' + '{:02d}'.format(i+1)] = [i[0] for i in words];
return pd.DataFrame(word_dict);
问题:是否可以将单词的LDA概率添加到表输出中?
Topic 1
car (0.050)
bike (0.035)
skateboard (0.020)
...