Question

我已经使用本教程http://ai.intelligentonlinetools.com/ml/k-means-clustering-example-word2vec/了解了使用NLTK软件包部分的K-clustering。

The code for NLTK K-clustering is like this :
X = model[model.wv.vocab]
Cluster = 2
Kcluster = KMeansClusterer(Cluster,distance = nltk.cluster.util.cosine_distance,repeats = 25 )
ApplyCluster = Kcluster.cluster(model[abc.vocab], assign_clusters=True)

所以为此我想要想象我们的结果，但我不知道。我试过plot.scatter（）但是我有一些问题需要定义要采用的参数。

Answer 1

我的建议：

尝试2种尺寸的各种组合。
使用PCA转换数据，然后选择前2个主要组件。
t-sne是一种可视化的降维技术。

编辑：您可能会看here绘制想法（技术性而非数学性）
EDIT 2 ：正如@Dan所提到的，基于相似度矩阵和修改后的算法，t-sne是一种很好的可视化降维技术（因此 t -sne这样做非常好。

Python可视化K聚类

1 个答案: