Question

我有一个numpy数组，我从一个形状为（930,360,720）的netCDF文件中引入，它被组织为（时间，纬度，经度）。

对于930个时间戳中的每一个的每个纬度/经度对，我需要计算该值达到或超过阈值的次数＆＃34; x＆＃34; （例如0.2或0.5等）并最终计算每个点超过阈值的百分比，然后输出结果，以便稍后绘制。

我尝试了很多方法，但这是我最近的方法：

lat_length = len(lats) 

#where lats has been defined earlier when unpacked from the netCDF dataset

lon_length = len(lons) 

#just as lats; also these were defined before using np.meshgrid(lons, lats)

for i in range(0, lat_length):
     for j in range(0, lon_length):
          if ice[:,i,j] >= x:
               #code to count number of occurrences here
               #code to calculate percentage here
               percent_ice[i,j] += count / len(time) #calculation 

 #then go on to plot percent_ice

我希望这是有道理的！我非常感谢任何帮助。我是用Python自学的，所以我可能会遗漏一些简单的东西。

这是使用any（）函数的时候吗？什么是最有效的方法来计算超过阈值的次数，然后计算百分比？

Answer 1

您可以将输入的3D数组与阈值x进行比较，然后沿着第一个轴与ndarray.sum(axis=0)求和以得到计数，从而获得百分比，如此 -

# Calculate count after thresholding with x and summing along first axis
count = (ice > x).sum(axis=0)

# Get percentages (ratios) by dividing with first axis length
percent_ice = np.true_divide(count,ice.shape[0])

Answer 2

啊，看，另一位气象学家！

可能有多种方法可以做到这一点，我的解决方案不太可能是最快的，因为它使用numpy的MaskedArray，这已知很慢，但这应该有效：

Numpy有一个名为MaskedArray的数据类型，它实际上包含两个普通的numpy数组。它包含数据数组和布尔掩码。我会首先屏蔽所有大于或等于我的阈值的数据（使用np.ma.masked_greater()只是大于）：

ice = np.ma.masked_greater_equal(ice)

然后，您可以使用ice.count()通过指定要沿特定轴计数来确定每个纬度/经度点的阈值低于阈值的数量：

n_good = ice.count(axis=0)

这应该返回一个包含好点数的二维数组。然后，您可以通过从n_good中减去ice.shape[0]来计算坏的数量：

n_bad = ice.shape[0] - n_good

并使用以下方法计算不良百分比：

perc_bad = n_bad/float(ice.shape[0])

如果不使用MaskedArray，有很多方法可以做到这一点。这只是我想到的简单方法。

计算Python中多维数组中达到或超过阈值的次数

2 个答案: