Question

我正在做一个机器学习项目来识别手写数字。实际上，我只想向MNIST添加更多数据集，但我无法这样做。

我做了以下事情：

n_samples = len(mnist.data)
x = mnist.data.reshape((n_samples, -1))# array of feature of 64 pixel
y = mnist.target                         # Class label from 0-9 as there are digits

img_temp_train=cv2.imread('C:/Users/amuly/Desktop/Soap/crop/2.jpg',0)

X_train, X_test, y_train, y_test = train_test_split(x, y, test_size=0.2)

#Now I want to add the img_temp_train to my dataset for training.

X_train=np.append(X_train,img_temp_train.reshape(-1))
y_train=np.append(y_train,[4.0])

训练后的长度是：

43904784（X_train）
56001（y_train）

但两者都应该是56001。

Answer 1

试试这个：

X_train = np.append(X_train, [img_temp_train], axis=0)

你不应该在不考虑你先做的事情的情况下不顾一切地重塑事物！

此外，使用连接通常是更好的主意：

X_train = np.concatenate((X_train, [img_temp_train]), axis=0)

将数据添加到MNIST数据集

1 个答案: