使用keras中的grid_search和flow_from_directory

时间:2019-04-06 19:51:37

标签: python keras scikit-learn grid-search hyperparameters

我试图优化我的角膜CNN中用于图像分类的超参数。我决定使用sklearn的网格搜索。我克服了用keras flow_from_directory制作x和y的基本困难,但仍然不起作用。 最后一行错误 ValueError: dropout is not a legal parameter

def grid_model(optimizer='adam',
                     kernel_initializer='random_uniform', 
                     dropout=0.2,
                    loss='categorical_crossentropy'):

    model = Sequential()

    model.add(Conv2D(6,(5,5),activation="relu",padding="same",
                     input_shape=(img_width, img_height, 3)))
    model.add(MaxPooling2D((2,2)))
    model.add(Dropout(dropout))

    model.add(Conv2D(16,(5,5),activation="relu"))
    model.add(MaxPooling2D((2,2)))
    model.add(Dropout(dropout))

    model.add(Flatten())

    model.add(Dense(120, activation='relu', kernel_initializer=kernel_initializer))
    model.add(Dropout(dropout))
    model.add(Dense(84, activation='relu', kernel_initializer=kernel_initializer))
    model.add(Dropout(dropout))
    model.add(Dense(10, activation='softmax'))

    model.compile(loss=loss,
                optimizer=optimizer,
                metrics=['accuracy'])
    return model

train_generator = ImageDataGenerator(rescale=1/255)

validation_generator = ImageDataGenerator(rescale=1/255)

# Retrieve images and their classes for train and validation sets
train_flow = train_generator.flow_from_directory(directory=train_data_dir, 
                                                 batch_size=batch_size, 
                                                 target_size=(img_height,img_width))

validation_flow = validation_generator.flow_from_directory(directory=validation_data_dir, 
                                                           batch_size=batch_size,
                                                           target_size=(img_height,img_width),
                                                            shuffle = False)

clf = KerasClassifier(build_fn=grid_model(), epochs=epochs, verbose=0)

param_grid = {
    'clf__optimizer':['adam', 'Nadam'],
    'clf__epochs':[100, 200],
    'clf__dropout':[0.1, 0.2, 0.5],
    'clf__kernel_initializer':['normal','uniform'],
    'clf__loss':['categorical_crossentropy', 
                     'sparse_categorical_crossentropy', 
                     'kullback_leibler_divergence']
}

pipeline = Pipeline([('clf',clf)])

(X_train, Y_train) = train_flow.next()

grid = GridSearchCV(pipeline, cv=2, param_grid=param_grid)
grid.fit(X_train, Y_train)

1 个答案:

答案 0 :(得分:1)

问题出在这一行:

clf = KerasClassifier(build_fn=grid_model(), epochs=epochs, verbose=0)

将其更改为

clf = KerasClassifier(build_fn=grid_model, epochs=epochs, verbose=0)

不应调用grid_model方法,而应传递对其的引用。

此外,在损失列表中,不能使用'sparse_categorical_crossentropy'(整数),因为模型所需的输出形状与'categorical_crossentropy'(一次性)的形状不兼容。