我正在尝试使用CNN使用keras对python中的医学图像进行分类。这些医学图像还包括可能影响模型决策的文本信息,例如年龄和性别。如何训练可以同时使用图像和真实世界信息进行训练的CNN,以便可以对两者进行分类。
答案 0 :(得分:1)
我可以想到几种可能性,但是最简单的方法是使用CNN从医学图像中提取一些特征,然后将CNN的结果展平,并合并非CNN的结果。图像数据。假设您有512x512图像和10个类,这是一个想法。这是功能性API,可让您有多个输入。
import tensorflow as tf
import numpy as np
num_classes = 10
H,W = 512, 512
# Define inputs with their shapes
imgs = tf.keras.Input((H,W,3), dtype = tf.float32)
genders = tf.keras.Input(1, dtype = tf.float32)
ages = tf.keras.Input(1, dtype = tf.float32)
# Extract image features
features = tf.keras.layers.Conv2D(64, 4, strides = 4, activation = 'relu')(imgs)
features = tf.keras.layers.MaxPooling2D()(features)
features = tf.keras.layers.Conv2D(128,3, strides = 2, activation = 'relu')(features)
features = tf.keras.layers.MaxPooling2D()(features)
features = tf.keras.layers.Conv2D(256, 3, strides = 2, activation = 'relu')(features)
features = tf.keras.layers.Conv2D(512, 3, strides = 2, activation = 'relu')(features)
# #Flatten output
flat_features = tf.keras.layers.Flatten()(features)
#Concatenate gender and age
flat_features = tf.concat([flat_features, genders, ages], -1)
# Downsample
xx = tf.keras.layers.Dense(2048, activation = 'relu')(flat_features)
xx = tf.keras.layers.Dense(1024, activation = 'relu')(xx)
xx = tf.keras.layers.Dense(512, activation = 'relu')(xx)
#Calculate probabilities for each class
logits = tf.keras.layers.Dense(num_classes)(xx)
probs = tf.keras.layers.Softmax()(logits)
model = tf.keras.Model(inputs = [imgs, genders, ages], outputs = probs)
model.summary()
这种体系结构不是特别标准,您可能希望使解码器更深和/或减少CNN编码器中的参数数量。