Question

在下面的代码中，y1和y2应该相等，但它们不是。 vectorize（）或dot（）中可能有错误吗？

import numpy as np
interval = np.arange(0, 30, 0.1)
y1 = [- 1.57 * max(0, x - 10) - 0.72 * max(0, 15 - x)
      - 1.09 * max(0, 20 - x) for x in interval]

def fun(x, pivot, truth):
    if truth: return max(0, x - pivot)
    else:     return max(0, pivot - x)

pivots = [10, 15, 20]
truths = [ 1,  0,  0]
coeffs = [-1.57, -0.72, -1.09]
y2 = [np.dot(np.vectorize(fun)(x, pivots, truths), coeffs) for x in interval]

import matplotlib.pyplot as plt
plt.plot(interval, y1, interval, y2)
plt.show()

y1和y2的图表：

Answer 1

我不确定这适用于您的情况，但vectorize有一些技巧。

如果您没有指定返回dtype，则会根据您的第一个案例通过测试计算来确定它。如果函数返回标量整数，如0，那么vectorize将返回一个整数数组。因此，如果您期望浮动，请确保指定返回dtype。

此外 - vectorize不是速度工具。它只是将广播应用于输入的便捷方式。它并不比显式循环输入快得多。

np.vectorize(fun, otypes=[float])

删除步骤。

===========

试试这个：

vfun = np.vectorize(fun, otypes=[float])
X = vfun(interval[:,None], pivots, truths)
print(X.shape)     # (300,3)
y2 = np.dot(X, coeffs)
print(y2.shape)    # (300,)

它更充分地利用vectorize's广播。

我怀疑您的fun可以被编写，以便在整个x上行动，而不会vectorize进行迭代。

更改fun以使用np.maximum，允许我提供数组x：

def fun(x, pivot, truth):
    if truth: return np.maximum(0, x - pivot)
    else:     return np.maximum(0, pivot - x)

然后，我可以只计算X和pivots的3个案例中的truths，一次计算所有interval值：

X = np.stack([fun(interval, p, t) for p, t in zip(pivots, truths)], axis=-1)
y2 = np.dot(X, coeffs)

应用3＆＃39;案例的另一种方式＆＃39;

Xlist = [fun(interval, p, t)*c for p, t, c in zip(pivots, truths, coeffs)]
y2 = np.sum(Xlist, axis=0)

由于np.dot(..., coeffs)只是一个加权和。我不确定它会更好。

Answer 2

为了应用适当的强制转换规则，numpy会偶尔使用你的函数和sentinel值（numpy.int64）来检查它输出的数据类型，如果它输出整数0，那就是max返回然后它假定计算的结果应该都是整数并舍弃其他结果，但是如果你改变函数总是在max：{/ 1>中使用0.0返回浮点数。

def fun(x, pivot, truth):
    if truth: return max(0.0, x - pivot)
    else:     return max(0.0, pivot - x)

然后numpy应用的检查将始终导致浮动结果，并且不会应用舍入。

NumPy vectorize（）或dot（）出现错误

2 个答案: