Question

我想要tarin ANN来查找RGB图片的平均.jpg？

我无法对此问题进行分类。在我看来，这不是classification问题，因为我没有任何课程/标签可以参加，而且这不是segmentation问题，因为我不想找到任何“区域”或图像中的“类”。这是一个什么样的问题？

在我看来，输入应该是.jpg图像和地面真实.png图像，平均值为RGB。输出应该是3个神经元（图像的红色，绿色和蓝色平均颜色），并且损失层应该是线性类型，因为RGB在0-255的范围内。我很感激你的帮助。感谢。

更新

在检查和训练NN的少数变体后，我的结论是这不是Regression问题。我将尝试使用由彩色图像组成的数据集full conv net及其原始版本作为地面真实图像

Answer 1

这里的技巧似乎是存储训练标签：每个输入图像有三个浮动标签。最直接的方式（恕我直言）是使用"HDF5Data"图层来存储图像和RGB目标标签。有关示例，请参阅this post。

获得数据后，您就可以拥有这个简单的网络

layer {
  name: "input"
  type: "HDF5Data"
  top: "data"
  top: "bgr_label"
  # ...
}
layer {
  name: "fc"
  type: "InnerProduct"
  bottom: "data"
  top: "bgr_mean"
  inner_product_param {
    num_output: 3 # predict BGR values
    bias_term: 0  # do not subtract mean, no need for bias in this task
    weight_filler { type: "constant" value: 1.5e-5 } # set the value to 1/num_pixels
  }
  # ... set learning rate and regularization multipliers
}
layer {
  name: "loss" 
  type: "EuclideanLoss"
  bottom: "bgr_mean"
  bottom: "bgr_label"
  top: "loss"
}

什么是NN类型，用于查找图像平均颜色（RGB）

1 个答案: