Question

我正在努力让这段代码尽可能快地运行，而且目前效率非常低。

我有一个标量数据的4D矩阵。 4个维度对应于纬度，经度，高度和时间。数据存储在一个numpy数组中，其形状为（5,5,30,2）。

在4个不同的列表中，我保留每个轴的“map”，存储与每个索引对应的值。例如，地图数组可能如下所示：

mapLatitude = [45.,45.2,45.4,45.6,45.8]
mapLongitude = [-10.8,-10.6,-10.4,-10.2,-10.]
mapAltitude = [0,50,100,150,...,1450]
mapTime = [1345673,1345674]

这意味着在数据矩阵中，位置0,1,3,0处的数据点对应于
Lat = 45，Lon = -10.6，Alt = 150，时间= 1345673。

现在，我需要生成一个新数组，其中包含数据矩阵中每个点的坐标。

到目前为止，这是我写的：

import numpy as np

# data = np.array([<all data>])
coordinateMatrix = [ 
   (mapLatitude[index[0]],
    mapLongitude[index[1]],
    mapAltitude[index[2]],
    mapTime[index[3]] ) for index in numpy.ndindex(data.shape) ]

这可行，但需要相当长的时间，特别是当数据矩阵的大小增加时（我需要使用带有类似（100,100,150,30）形状的矩阵）。

如果有帮助，我需要生成此coordinateMatrix以将其提供给scipy.interpolate.NearestNDInterpolator。

有关如何提高速度的任何建议？
非常感谢你！

Answer 1

如果您将列表转换为ndarray，则可以按如下方式使用广播：

coords = np.zeros((5, 5, 30, 2, 4))
coords[..., 0] = np.array(mapLatitude).reshape(5, 1, 1, 1)
coords[..., 1] = np.array(mapLongitude).reshape(1, 5, 1, 1)
coords[..., 2] = np.array(mapAltitude).reshape(1, 1, 30, 1)
coords[..., 3] = np.array(mapTime).reshape(1, 1, 1, 2)

对于更一般的输入，这样的事情应该有效：

def makeCoordinateMatrix(*coords) :
    dims = len(coords)
    coords = [np.array(a) for a in coords]
    shapes = tuple([len(a) for a in coords])
    ret = np.zeros(shapes + (dims,))
    for j, a in enumerate(coords) :
        ret[..., j] = a.reshape((len(a),) + (1,) * (dims - j - 1))
    return ret

coordinateMatrix = makeCoordinateMatrix(mapLatitude, mapLongitude,
                                        mapAltitude, mapTime)

Python：通过迭代加速矩阵坐标映射

1 个答案: