我是tensorflow的新手,我查看了教程并成功执行了它们。 现在我必须解决一个问题,我的输出不应该像MNIST Labels(1-10)这样的分类。我想计算图像中的对象,因此我只需要一个数字输出值,因为结果可以在0-300 +之间的范围内,因此在单热矢量中编码的结果不适用。
我的代码在下面(大部分是从MNIST tut复制的)如果我有多个类并且标签是在一个热矢量中编码的话它工作正常。
我认为我必须调整的是成本函数:
成本= tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(prediction,y))
但我不知道该怎么做。请有人帮帮我 所以预测返回一个值,y(地面实况)也是一个值,例如。[5]对于5个物体。
### CNN CONFIG
n_classes = 1
batch_size = 100
x = tf.placeholder('float', [None, 16384]) # 128*128 = 16384 28*28 = 784
y = tf.placeholder('float')
keep_rate = 0.8
keep_prob = tf.placeholder(tf.float32)
def conv2d(x, W):
return tf.nn.conv2d(x, W, strides=[1, 1, 1, 1], padding='SAME')
def maxpool2d(x):
# size of window movement of window
return tf.nn.max_pool(x, ksize=[1, 2, 2, 1], strides=[1, 2, 2, 1], padding='SAME')
def convolutional_neural_network(x):
weights = {'W_conv1': tf.Variable(tf.random_normal([5, 5, 1, 32])),
'W_conv2': tf.Variable(tf.random_normal([5, 5, 32, 64])),
'W_fc': tf.Variable(tf.random_normal([32 * 32 * 64, 1024])),
'out': tf.Variable(tf.random_normal([1024, n_classes]))}
biases = {'b_conv1': tf.Variable(tf.random_normal([32])),
'b_conv2': tf.Variable(tf.random_normal([64])),
'b_fc': tf.Variable(tf.random_normal([1024])),
'out': tf.Variable(tf.random_normal([n_classes]))}
x = tf.reshape(x, shape=[-1, 128, 128, 1])
conv1 = tf.nn.relu(conv2d(x, weights['W_conv1']) + biases['b_conv1'])
conv1 = maxpool2d(conv1)
conv2 = tf.nn.relu(conv2d(conv1, weights['W_conv2']) + biases['b_conv2'])
conv2 = maxpool2d(conv2)
#fc = tf.reshape(conv2, [-1, 7 * 7 * 64])
fc = tf.reshape(conv2, [-1, 32 * 32 * 64])
fc = tf.nn.relu(tf.matmul(fc, weights['W_fc']) + biases['b_fc'])
#fc = tf.nn.dropout(fc, keep_rate)
output = tf.matmul(fc, weights['out']) + biases['out']
return output
def train_neural_network(x):
prediction = convolutional_neural_network(x)
cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(prediction, y))
optimizer = tf.train.AdamOptimizer().minimize(cost)
#saver
saver = tf.train.Saver()
hm_epochs = 50
with tf.Session() as sess:
sess.run(tf.initialize_all_variables())
for epoch in range(hm_epochs):
epoch_loss = 0
logging.debug('Epoch: ' + str(epoch) +' started')
for i in range(int(len(train_database['images']) / batch_size)):
epoch_x, epoch_y = getNextBatch(train_database, (i + 1) * batch_size)
_, c = sess.run([optimizer, cost], feed_dict={x: epoch_x, y: epoch_y})
epoch_loss += c
print('Epoch', epoch, 'completed out of', hm_epochs, 'loss:', epoch_loss)
答案 0 :(得分:2)
标准方法是使用均方误差:
cost = tf.reduce_mean(tf.square(prediction - y))