Question

我有一个形状为(1000,3)的numpy点数组其中轴1采用值[x,y,1]

这些点位于网格上的离散值，因此示例数组如下所示：

array=([1,2,1],[4,5,1],[2,3,1],...,[xN,yN,1])

我想扩展这个2d数组，我的意思是，对于数组中的每个[x,y,1]坐标，如果数组中没有[x±1,y±1,1]，则将它附加到数组中。

目前我正在使用以下代码执行此操作：

np.append(array, [array[:,0],array[:,1]+1,1])
np.append(array, [array[:,0]+1,array[:,1],1])
np.append(array, [array[:,0]+1,array[:,1]+1,1])
np.append(array, [array[:,0]-1,array[:,1],1])
np.append(array, [array[:,0],array[:,1]-1,1])
np.append(array, [array[:,0]-1,array[:,1]-1,1])
np.append(array, [array[:,0]+1,array[:,1]-1,1])
np.append(array, [array[:,0]-1,array[:,1]+1,1])

然后我使用np.unique(array)减少到unqiue元素。这种方法有效，但在超过100000点的大型阵列上运行速度太慢，并且感觉不是一个平滑的解决方案。必须有一种方法可以做到这一点，而不必重复这么多点，然后必须找到所有唯一的实例。是否有不同的（读取：更快）方式来做我正在做的事情？

Answer 1

2000 x 4000 x 200只适用于查找表。在低于一百万坐标的情况下，与np.unique方法相比，我得到的加速比例为~5。

lookup table:  2.18715, np.unique: 11.40247

代码：

import numpy as np
from numpy.lib.stride_tricks import as_strided
from time import time

coords = np.unique(np.random.randint(0, 2000*4000*200, (1000000,)))
coords = np.c_[coords // (4000*200), (coords // 200) % 4000, coords % 200]

t = [time()]

ws = np.empty((2002, 4002, 202), dtype=np.uint8)
ws = as_strided(ws, (2000, 4000, 200, 3, 3, 3), 2 * ws.strides)

ws[tuple(coords.T)] = np.arange(27).reshape(3, 3, 3)
unq = ws[tuple(coords.T)] == np.arange(27).reshape(3, 3, 3)
result = (coords[:, None, None, None, :] + np.moveaxis(np.indices((3, 3, 3)) - 1, 0, -1))[unq]
del ws

t.append(time())

result2 = np.unique((coords[:, None, None, None, :] + np.moveaxis(np.indices((3, 3, 3)) - 1, 0, -1)).reshape(-1, 3), axis = 0)

t.append(time())

print('lookup table: {:8.5f}, np.unique: {:8.5f}'.format(*np.diff(t)))

如何扩大numpy阵列

1 个答案: