Question

我正在学习如何使用Keras和CIFAR-10数据集实现数据增强。我正在学习在线教程和本书Deep learning with Keras.

的帮助

代码的具体细节是here。

这是我的问题，我确信这与我的一些误解有关：

这是我的CONV设置。

 IMG_CHANNELS = 3
 IMG_ROWS = 32
 IMG_COLS = 32
 BATCH_SIZE = 128
 NB_EPOCH = 50
 NB_CLASSES = 10
 VERBOSE = 1
 VALIDATION_SPLIT = 0.2
 OPTIM = RMSprop()

加载数据集，转换为分类，浮点和规范化：

(X_train, y_train), (X_test, y_test) = cifar10.load_data()
Y_train = np_utils.to_categorical(y_train, NB_CLASSES)
Y_test = np_utils.to_categorical(y_test, NB_CLASSES) 
X_train = X_train.astype('float32')
X_test = X_test.astype('float32')
X_train /= 255
X_test /= 255

创建生成器

datagen = ImageDataGenerator(
        featurewise_center=False,  # set input mean to 0 over the dataset
        samplewise_center=False,  # set each sample mean to 0
        featurewise_std_normalization=False,  # divide inputs by std of the dataset
        samplewise_std_normalization=False,  # divide each input by its std
        zca_whitening=False,  # apply ZCA whitening
        rotation_range=0,  # randomly rotate images in the range (degrees, 0 to 180)
        width_shift_range=0.1,  # randomly shift images horizontally (fraction of total width)
        height_shift_range=0.1,  # randomly shift images vertically (fraction of total height)
        horizontal_flip=True,  # randomly flip images
        vertical_flip=False)  # randomly flip images
 datagen.fit(X_train)

训练模型（我还没有列出模型）

model.fit_generator(datagen.flow(X_train, Y_train,
                   batch_size=BATCH_SIZE),
                   samples_per_epoch=X_train.shape[0],
                   nb_epoch=NB_EPOCH, 
                   verbose=VERBOSE)

我的问题是，当我训练时，会显示以下内容：

 Epoch 1/40
 390/390 [==============================] - 199s - loss: 0.9751 - acc: 0.6588

我不明白为什么我会得到390个例子。 Samples_per_epoch等于X_train.shape [0]，即50000，批量大小为128，所以我认为它应该以128个批量上升到50000.

Answer 1

进度条未显示样本数，但显示步数或批次数（当您使用model.fit代替model.fit_generator时，它会自动显示样本）。每批包含128个样品，总共有50,000个样品。 50,000/128 = 390.625。这就是为什么你看到390而不是50,000。

由于您使用的是model.fit_generator，因此无法显示样本总数。除非您将batch_size设置为1.这样做的原因是期望生成器无限期地循环其数据，直到达到steps_per_epochs或samples_per_epoch阈值(*)

顺便说一句，您可以使用回调model.fit在ProgbarLogger中更改此内容，查看here。

数据增强期间的Keras CONV培训似乎显示错误的批量大小和培训示例的数量

1 个答案: