Question

我有一个numpy矩阵，我想获取每一行中最大值的索引。例如

[[1,2,3],[1,3,2],[3,2,1]]

将返回

[0,1,2]

但是，当每行中的最大值超过1个时，numpy.argmax将仅返回最小的索引。例如

[[0,0,0],[0,0,0],[0,0,0]]

将返回

[0,0,0]

我可以将默认值（最小索引）更改为其他一些值吗？例如。当最大值相等时，返回1或None，以便上述结果为

[1,1,1]
or
[None, None, None]

如果我可以在TensorFlow中做到这一点，那就更好了。

谢谢！

Answer 1

您可以使用np.partition中的两个找到两个最大值并检查它们是否相等，然后将其用作np.where中的掩码以设置默认值：

In [228]: a = np.array([[1, 2, 3, 2], [3, 1, 3, 2], [3, 5, 2, 1]])

In [229]: twomax = np.partition(a, -2)[:, -2:].T

In [230]: default = -1

In [231]: argmax = np.where(twomax[0] != twomax[1], np.argmax(a, -1), default)

In [232]: argmax
Out[232]: array([ 2, -1,  1])

Answer 2

“ default”的方便值为-1，因为argmax不会自行返回该值。 None不适合整数数组。也可以选择masked array，但我没有走那么远。这是NumPy的实现

def my_argmax(a):
    rows = np.where(a == a.max(axis=1)[:, None])[0]
    rows_multiple_max = rows[:-1][rows[:-1] == rows[1:]]
    my_argmax = a.argmax(axis=1)
    my_argmax[rows_multiple_max] = -1
    return my_argmax

使用示例：

import numpy as np
a = np.array([[0, 0, 0], [4, 5, 3], [3, 4, 4], [6, 2, 1]])
my_argmax(a)   #  array([-1,  1, -1,  0])

说明：where选择每行中 all 个最大元素的索引。如果一行具有多个最大值，则行号将在rows数组中出现多次。由于此数组已经排序，因此可以通过比较连续元素来检测这种重复。这将标识具有多个最大值的行，然后在NumPy的argmax方法的输出中将其屏蔽。

数值相等时的numpy argmax

2 个答案: