Question

我知道MAPE和WMAPE作为预测误差指标，它们有一些好处。但差距是什么？有人说：

For MAPE:
"Combinations with very small or zero volumes can cause large skew in results"

And for WMAPE:
"Combinations with large weights can skew the results in their favor"

我无法理解，任何人都能解释这两个指标弱点的两个陈述吗？谢谢。

Answer 1

对于MAPE，平均绝对百分比误差[1]，假设我们用 A 表示实际值，用 P 表示预测值。您在1到n时间有一系列数据，然后

MAPE = 100/n * ( Sum of |(A(t) - P(t))/A(t)| ), for t in 1..n
where A(t) is the actual value at time t, P(t) is the predicted value at time t.

由于A（t）在分母中，每当你有一个非常小或接近零的A（t）时，该除法就像是除以零，这会在绝对百分比误差中产生非常大的变化。如此大的变化的组合肯定会导致结果的大幅度偏差。

对于WMAPE，加权平均绝对百分比误差，

         Sum of |(A(t) - P(t))/A(t)| * W(t)
WMPAE = -------------------------------------, for t in 1..n
                    Sum of W(t)

        where W(t) is the weight you associate with the prediction at time t.

由于这是一种加权测量，因此它没有与MAPE相同的问题，例如由于体积非常小或零而导致的过度倾斜。

然而，加权因子表明我们希望对每个预测进行主观重要性[2]。

例如，考虑发布日期，我们可以以更高的方式分配权重重量，我们对更近期的重要性数据。在这种情况下，我们甚至可以观察到MAE 是在一个合理的门槛下，一个系统的表现在分析这个特定功能时可能不够。

这是对最近数据的偏好如何扭曲结果。

[1] http://en.wikipedia.org/wiki/Mean_absolute_percentage_error
[2] http://ir.ii.uam.es/rue2012/papers/rue2012-cleger-tamayo.pdf

Answer 2

还有另一个错误指标：

WAPE = 100/n * Sum(|(A(t) - P(t)|)/sum(A(t)), for t in 1..n
where A(t) is the actual value at time t, P(t) is the predicted value at time t.

对大变形不敏感。

预测误差指标的差距是什么：MAPE和WMAPE？

2 个答案: