我有一个word2vec模型,我想使用“层次聚类”病房方法对其进行聚类,而我的代码在这里。
import gensim
import csv
import numpy as np
path = './model_terbaru/idwiki_word2vec_100.model'
model = gensim.models.word2vec.Word2Vec.load(path)
X = model[model.wv.vocab]
from sklearn.cluster import AgglomerativeClustering
cluster = AgglomerativeClustering(n_clusters=130, affinity='euclidean', linkage='ward')
assigned_clusters=cluster.fit_predict(X)
words = list(model.wv.vocab)
for i, word in enumerate(words):
try:
mylist= np.append(word,str(assigned_clusters[i]))
ok=open('hasil_cluster30.csv','a')
a=csv.writer(ok,lineterminator='\n')
a.writerows([mylist])
except:
pass
ok.close()
我的代码有错误
dm = np.empty((m * (m - 1)) // 2, dtype=np.double)
MemoryError
请帮助我。