我正在尝试在迷你批处理k均值实现(使用python)上测试来自scikit-learn的真实数据集,但这给了我错误![我没有更改代码中的任何内容,只是数据集的名称]玩具数据集使代码正确运行
注意:fetch_olivetti_faces使用相同的代码即可正常工作,但不能使用任何其他数据集
from sklearn.datasets import fetch_20newsgroups_vectorized
boston =fetch_20newsgroups_vectorized()
我遇到了这个错误
Traceback (most recent call last):
File "C:/Users/User/PycharmProjects/untitled/venv/Scripts/mbkm.py", line 18, in <module>
boston =fetch_20newsgroups_vectorized()
File "C:\Users\User\PycharmProjects\untitled\venv\lib\site-packages\sklearn\datasets\twenty_newsgroups.py", line 406, in fetch_20newsgroups_vectorized
X_train, X_test = _joblib.load(target_file)
File "C:\Users\User\PycharmProjects\untitled\venv\lib\site-packages\sklearn\externals\joblib\numpy_pickle.py", line 598, in load
obj = _unpickle(fobj, filename, mmap_mode)
File "C:\Users\User\PycharmProjects\untitled\venv\lib\site-packages\sklearn\externals\joblib\numpy_pickle.py", line 526, in _unpickle
obj = unpickler.load()
.
.
raise ValueError(msg % (error_template, size, len(data)))
ValueError: EOF: reading array data, expected 45260 bytes got 23049