Question

我正在寻找一种方法来使用TensorFlow专门获得张量的局部最大值的索引。

TL;博士

我不是数据科学家。我不太了解计算机视觉背后的理论，但我正在尝试使用TensorFlow构建计算机视觉应用程序。我计划保存我的模型，并使用TF服务将其称为服务，因此我不能依赖外部库，如numpy，scipy等。我想要完成的算法与scipy相同signal.argrelextrema，但是可以用我的模型保存并重新运行。其他算法已经显示here，但没有一个在TensorFlow中执行。有人能指出我正确的方向吗？

Answer 1

我认为你没有提供足够的信息来澄清。首先，我不确定你想获得Tensor的最大元素（tf中有一个函数），或者你想找到函数的局部最大值（不是Tensor）。在这种情况下，您可以恢复该功能并找到导致您正在寻找的本地最小值。

Answer 2

修改

我的第一个解决方案是功能性的，但效率低下。它需要五次张量的张量（零跟踪，反向，零跟踪，反向，其中）。我现在有一个只需要两次迭代的解决方案，而且还足够灵活，可以快速识别局部最小值...

def get_slope(prev, cur): # A: Ascending # D: Descending # P: PEAK (on previous node) # V: VALLEY (on previous node) return tf.cond(prev[0] < cur, lambda: (cur, ascending_or_valley(prev, cur)), lambda: (cur, descending_or_peak(prev, cur))) def ascending_or_valley(prev, cur): return tf.cond(tf.logical_or(tf.equal(prev[1], 'A'), tf.equal(prev[1], 'V')), lambda: np.array('A'), lambda: np.array('V')) def descending_or_peak(prev, cur): return tf.cond(tf.logical_or(tf.equal(prev[1], 'A'), tf.equal(prev[1], 'V')), lambda: np.array('P'), lambda: np.array('D')) def label_local_extrema(tens): """Return a vector of chars indicating ascending, descending, peak, or valley slopes""" # initializer element values don't matter, just the type. initializer = (np.array(0, dtype=np.float32), np.array('A')) # First, get the slope for each element slope = tf.scan(get_slope, tens, initializer) # shift by one, since each slope indicator is the slope # of the previous node (necessary to identify peaks and valleys) return slope[1][1:] def find_local_maxima(tens): """Return the indices of the local maxima of the first dimension of the tensor""" return tf.squeeze(tf.where(tf.equal(label_local_extrema(blur_x_tf), 'P')))

结束编辑

好的，我已经设法找到了解决方案，但它并不漂亮。以下函数采用1D张量，并将非局部最大值的所有点减少为零。这只适用于正数，并且需要修改float32以外的数据类型，但它符合我的需要。

但必须有更好的方法来做到这一点。

def zero_descent(prev, cur): """reduces all descent steps to zero""" return tf.cond(prev[0] < cur, lambda: (cur, cur), lambda: (cur, 0.0)) def skeletonize_1d(tens): """reduces all point other than local maxima to zero""" # initializer element values don't matter, just the type. initializer = (np.array(0, dtype=np.float32), np.array(0, dtype=np.float32)) # First, zero out the trailing side trail = tf.scan(zero_descent, tens, initializer) # Next, let's make the leading side the trailing side trail_rev = tf.reverse(trail[1], [0]) # Now zero out the leading (now trailing) side lead = tf.scan(zero_descent, trail_rev, initializer) # Finally, undo the reversal for the result return tf.reverse(lead[1], [0]) def find_local_maxima(tens): return tf.where(skeletonize_1d >0)

Answer 3

伪：

input_matrix == max_pool(input_matrix)

说明：

当输入值与max_pooling所采用的值相同时，表示输入值最大。

使用TensorFlow

3 个答案: