Question

我正在尝试训练一个需要4通道输入的转换网络，并希望使用像VGG16这样的预训练模型。有意义的是，我不应该使用VGG16中的初始转换块，因为它们经过了3通道输入的训练，并重新定义了初始转换块。

但是，我想从VGG16开始使用block3。如何使用Tensorflow Keras api实现此目标？

简而言之，我该如何从预训练模型的特定图层复制权重。我正在使用tensorflow 2.0 alpha版本。

Answer 1

执行此操作的快速方法是创建一个新模型，该模型将您的自定义输入与VGG16的最后一层结合在一起。找到您要保留的第一个VGG16层的索引，并将其连接到新创建的输入。然后，手动连接下面的每个VGG16层，以重新创建VGG16段。您可以一路冻结VGG16层。

from tensorflow.keras.applications.vgg16 import VGG16
from tensorflow.keras.preprocessing import image
from tensorflow.keras.applications.vgg16 import preprocess_input
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Input, Conv2D

vgg16 = VGG16()

# Find the index of the first block3 layer
for index in range(len(vgg16.layers)):
    if 'block3' in vgg16.layers[index].name:
        break

# Add your own input
model_input = Input(shape=(224,224,4), name='new_input')
x = Conv2D(...)(model_input)
...

# Connect your last layer to the VGG16 model, starting at the "block3" layer
# Then, you need to connect every layer manually in a for-loop, freezing each layer along the way

for i in range(index, len(vgg16.layers)):
  # freeze the VGG16 layer
  vgg16.layers[i].trainable = False  

  # connect the layer
  x = vgg16.layers[i](x)

model_output = x
newModel = Model(model_input, model_output)

还要确保自定义图层的输出与block3图层期望作为输入的形状相匹配。

如何使用Tensorflow Keras API从预训练的模型复制特定的图层权重？

1 个答案: