Question

我从Google或github上什么都没得到。

到目前为止，我必须将形状为[N，C，H，W]的两个Blob切片为形状为[N，1，H，W]的2 * C个Blob，并将新的Blob置换为形状为[N， H，W，1]，然后在新Blob上以内核大小= 1进行池化。然后连接到[N，H，W，C]并最终置换为[N，C，H，W]。

有什么好的渠道明智的池化实现吗？

Answer 1

对我来说，这听起来像不是通道级的池化（必须产生单个输出通道），而是element-wise MAX操作：

layer {
  name: "input_1"
  type: "Input"
  top: "input_1"
  input_param {
    shape {
      dim: 1
      dim: 2
      dim: 3
      dim: 3
    }
  }
}

layer {
  name: "input_2"
  type: "Input"
  top: "input_2"
  input_param {
    shape {
      dim: 1
      dim: 2
      dim: 3
      dim: 3
    }
  }  
}

layer {
   name: "channel_max"
   type: "Eltwise"
   bottom: "input_1"
   bottom: "input_2"
   top: "channel_max"
   eltwise_param {
      operation: MAX
   }
}

以下代码：

import caffe
import numpy as np


caffe.set_mode_cpu()
net = caffe.Net('net.prototxt', 1)

net.blobs['input_1'].data[...] = np.random.randint(10, size=(1, 2, 3, 3))
net.blobs['input_2'].data[...] = np.random.randint(10, size=(1, 2, 3, 3))

net.forward()
print('Blob #1:')
print(net.blobs['input_1'].data)
print('Blob #2:')
print(net.blobs['input_2'].data)

print('Result:')
print(net.blobs['channel_max'].data)

将两个blob合并为一个，并用相同数量的通道填充特征图的最大值：

Blob #1:
[[[[5. 6. 5.]
   [1. 6. 1.]
   [4. 7. 6.]]

  [[9. 8. 3.]
   [8. 8. 8.]
   [6. 9. 9.]]]]
Blob #2:
[[[[4. 1. 1.]
   [2. 1. 3.]
   [6. 1. 0.]]

  [[3. 8. 7.]
   [8. 2. 4.]
   [2. 8. 1.]]]]
Result:
[[[[5. 6. 5.]
   [2. 6. 3.]
   [6. 7. 6.]]

  [[9. 8. 7.]
   [8. 8. 8.]
   [6. 9. 9.]]]]

使用caffe进行任何通道明智的池化实现吗？

1 个答案: