Question

我正在尝试使用Numpy / Scipy将3D大气数据从一个垂直坐标插值到另一个垂直坐标。例如，我有温度和相对湿度的立方体，两者都在恒定的常规压力表面上。我想将相对湿度插值到恒温表面。

我之前已经问过我要解决的确切问题{{3}}，但是，解决方案非常缓慢。在我的情况下，我的立方体中有大约3M点（30x321x321），并且该方法在一组数据上运行大约需要4分钟。

这篇文章差不多有5年了。 Numpy / Scipy的新版本可能有更快的处理方法吗？也许看看问题的新眼睛有更好的方法吗？我愿意接受建议。

编辑：

对于一组数据立方体，

慢= 4分钟。我不确定我还能如何量化它。

正在使用的代码......

def interpLevel(grid,value,data,interp='linear'):
    """
    Interpolate 3d data to a common z coordinate.

    Can be used to calculate the wind/pv/whatsoever values for a common
    potential temperature / pressure level.

    grid : numpy.ndarray
       The grid. For example the potential temperature values for the whole 3d
       grid.

    value : float
       The common value in the grid, to which the data shall be interpolated.
       For example, 350.0

    data : numpy.ndarray
       The data which shall be interpolated. For example, the PV values for
       the whole 3d grid.

    kind : str
       This indicates which kind of interpolation will be done. It is directly
       passed on to scipy.interpolate.interp1d().

    returns : numpy.ndarray
       A 2d array containing the *data* values at *value*.

    """
    ret = np.zeros_like(data[0,:,:])
    for yIdx in xrange(grid.shape[1]):
        for xIdx in xrange(grid.shape[2]):
            # check if we need to flip the column
            if grid[0,yIdx,xIdx] > grid[-1,yIdx,xIdx]:
                ind = -1
            else:
                ind = 1
            f = interpolate.interp1d(grid[::ind,yIdx,xIdx], \
                data[::ind,yIdx,xIdx], \
                kind=interp)
            ret[yIdx,xIdx] = f(value)
    return ret

编辑2：如果有人对我正在使用的内容感兴趣，我可以共享样本数据的npy转储。

Answer 1

由于这是大气数据，我想你的网格没有均匀的间距;但是如果你的网格是直线的（这样每个垂直列都有相同的z坐标集）那么你有一些选择。

例如，如果您只需要线性插值（比如简单的可视化），您可以执行以下操作：

# Find nearest grid point
idx = grid[:,0,0].searchsorted(value)
upper = grid[idx,0,0]
lower = grid[idx - 1, 0, 0]
s = (value - lower) / (upper - lower)
result = (1-s) * data[idx - 1, :, :] + s * data[idx, :, :]

（当然，你需要添加value超出范围的检查。）对于你的大小的网格，这将非常快（如在很短的一秒钟内）

如果需要，你可以很容易地修改上面的内容来执行三次插值;挑战在于为非均匀垂直间距选择正确的权重。

使用scipy.ndimage.map_coordinates的问题在于，尽管它提供了更高阶的插值并且可以处理任意采样点，但它确实假设输入数据是均匀间隔的。它仍然会产生平滑的结果，但它不是一个可靠的近似值。

如果您的坐标网格不是直线的，那么给定索引的z值会因不同的x和y索引而变化，那么您现在使用的方法可能是您可以获得的最佳方法，而无需进行相当多的分析。你的特殊问题。

更新：

一个巧妙的技巧（同样，假设每列具有相同的，不一定是常规的坐标）是使用interp1d来提取权重，如下所示：

NZ = grid.shape[0]
zs = grid[:,0,0]
ident = np.identity(NZ)
weight_func = interp1d(zs, ident, 'cubic')

每个网格只需执行上述操作一次;只要垂直坐标不变，你甚至可以重复使用weight_func。

当需要进行插值时，weight_func(value)将为您提供权重，您可以使用这些权重计算(x_idx, y_idx)处的单个插值：

weights = weight_func(value)
interp_val = np.dot(data[:, x_idx, y_idx), weights)

如果要计算插值的整个平面，可以使用np.inner，但是因为你的z坐标是第一个，所以你需要这样做：

result = np.inner(data.T, weights).T

同样，计算应该是实际上立即的。

Numpy / Scipy中大气数据的快速3D插值

1 个答案: