Question

我有一个卷积神经网络，我最近重构使用Tensorflow的Estimator API，很大程度上遵循this tutorial。但是，在训练期间，我添加到EstimatorSpec的指标未显示在Tensorboard上，并且似乎没有在tfdbg中进行评估，尽管图表中的名称范围和指标存在于写入Tensorboard的图表中。

model_fn的相关位如下：

 ...

 predictions = tf.placeholder(tf.float32, [num_classes], name="predictions")

 ...

 with tf.name_scope("metrics"):
    predictions_rounded = tf.round(predictions)
    accuracy = tf.metrics.accuracy(input_y, predictions_rounded, name='accuracy')
    precision = tf.metrics.precision(input_y, predictions_rounded, name='precision')
    recall = tf.metrics.recall(input_y, predictions_rounded, name='recall')

if mode == tf.estimator.ModeKeys.PREDICT:
    spec = tf.estimator.EstimatorSpec(mode=mode,
                                      predictions=predictions)
elif mode == tf.estimator.ModeKeys.TRAIN:

    ...

    # if we're doing softmax vs sigmoid, we have different metrics
    if cross_entropy == CrossEntropyType.SOFTMAX:
        metrics = {
            'accuracy': accuracy,
            'precision': precision,
            'recall': recall
        }
    elif cross_entropy == CrossEntropyType.SIGMOID:
        metrics = {
            'precision': precision,
            'recall': recall
        }
    else:
        raise NotImplementedError("Unrecognized cross entropy function: {}\t Available types are: SOFTMAX, SIGMOID".format(cross_entropy))
    spec = tf.estimator.EstimatorSpec(mode=mode,
                                      loss=loss,
                                      train_op=train_op,
                                      eval_metric_ops=metrics)
else:
    raise NotImplementedError('ModeKey provided is not supported: {}'.format(mode))

return spec

任何人都有任何想法为什么不写这些？我正在使用Tensorflow 1.7和Python 3.5。我尝试通过tf.summary.scalar显式添加它们，虽然它们以这种方式进入Tensorboard，但在第一次通过图表后它们永远不会更新。

Answer 1

指标API有一个扭曲，让我们以tf.metrics.accuracy为例（所有tf.metrics.*的工作方式相同）。这会返回2个值，accuracy指标和upate_op，这看起来像是您的第一个错误。你应该有这样的东西：

accuracy, update_op = tf.metrics.accuracy(input_y, predictions_rounded, name='accuracy')

accuracy只是您希望计算的值，但请注意，您可能希望计算对sess.run的多次调用的准确性，例如，当您计算大型精度时测试集并非全部适合内存。这就是update_op的来源，它会产生结果，因此当您要求accuracy时，它会为您提供一个正在运行的记录。

update_op没有依赖关系，因此您需要在sess.run中显式运行它或添加依赖项。例如，您可以将其设置为依赖于成本函数，以便在计算成本函数时计算update_op（导致运行计数以更新准确性）：

with tf.control_dependencies(cost):
  tf.group(update_op, other_update_ops, ...)

您可以使用局部变量初始值设定项重置指标的值：

sess.run(tf.local_variables_initializer())

如你所提到的那样，你需要使用tf.summary.scalar(accuracy)为张量板添加准确性（虽然看起来你添加了错误的东西）。

自定义Estimator

1 个答案: