Question

我正在处理三维numpy数组中存在的一些大体积图像数据。我将用两个小的1D阵列解释我的任务。我有一张图片：

img = [5, 6, 70, 80, 3, 4, 80, 90]

和该图像的一个分段和标记版本：

labels = [0, 0, 1, 1, 0, 0, 2, 2]

labels中的每个数字代表img中的一个对象。两个阵列都具有相同的尺寸。所以在这个例子中，img中有两个对象：

[5,6， 70 ， 80 ，3,4， 80 ， 90 ]

我现在要做的是找到每个对象的最大值的位置，在这种情况下是3和7。目前我遍历所有标签，创建img版本，其中只包含与当前标签对应的对象，并查找最大值：

for label in range(1, num_labels + 1):
    imgcp = np.copy(img)
    imgcp[labels != label] = 0
    max_pos = np.argmax(imgcp)
    max_coords = np.unravel_index(pos, imgcp.shape)

这种方法的一个问题是在每一步中复制img都会产生内存错误。我觉得内存管理应该阻止这种情况，但是有更高效的内存和可能更快的方式来完成这项任务吗？

Answer 1

以下是使用argpartition的方法。

# small 2d example
>>> data = np.array([[0,1,4,0,0,2,1,0],[0,4,1,3,0,0,0,0]])
>>> segments = np.array([[0,1,1,0,0,2,2,0],[0,1,1,1,0,0,0,0]])
>>> 
# discard zeros
>>> nz = np.where(segments)
>>> segc = segments[nz]
>>> dac = data[nz]

# count object sizes
>>> cnts = np.bincount(segc)
>>> bnds = np.cumsum(cnts)
# use counts to partition into objects
>>> idx = segc.argpartition(bnds[1:-1])
>>> dai = dac[idx]
# find maxima per object
>>> mx = np.maximum.reduceat(dai, bnds[:-1])
# find their positions
>>> am, = np.where(dai==mx.repeat(cnts[1:]))
# translate positions back to coordinate space
>>> im = idx[am]
>>> am = *(n[im] for n in nz),
>>> 
>>> 
# result
# coordinates, note that there are more points than objects because
# the maximum 4 occurs twice in object 1
>>> am
(array([1, 0, 0]), array([1, 2, 5]))
# maxima
>>> data[am]
array([4, 4, 2])
# labels
>>> segments[am]
array([1, 1, 2])

在大型3D Numpy阵列中查找局部最大值

1 个答案: