如何撤消平方根变换变量的差分(滞后阶数1)? [ARIMA()]

时间:2017-05-31 19:15:41

标签: time-series transform difference

我正在使用statsmodels.tsa.arima_model.ARIMA,在将其插入算法之前,我采用了内生变量的平方根变换。该模型使用差分顺序1:

model = ARIMA(sj_sqrt, order=(2, 1, 0))

在拟合模型并抓住预测后,我想将预测放回原始形式,以便与原始数据进行比较。但是,我似乎无法正确地改变它们。

要复制此问题的简单版本,请参阅以下代码:

#original data:
test = pd.Series([1,1,1,50,1,1,1,1,1,1,1,1,40,1,1,2,1,1,1,1,1]) 

#sqrt transformed data:
test_sqrt = np.sqrt(test)

#sqrt differenced data:
test_sqrt_diff = test_sqrt.diff(periods=1)

#undo differencing:
test_sqrt_2 = cumsum(test_sqrt_diff)

#undo transformations:
test_2 = test_sqrt_2 ** 2

f, axarr = plt.subplots(5, sharex=True, sharey=True)
axarr[0].set_title('original data:')
axarr[0].plot(test)
axarr[1].set_title('sqrt transformed data:')
axarr[1].plot(test_sqrt)
axarr[2].set_title('sqrt differenced data:')
axarr[2].plot(test_sqrt_diff)
axarr[3].set_title('differencing undone with .cumsum():')
axarr[3].plot(test_sqrt_2)
axarr[4].set_title('transformation undone by squaring:')
axarr[4].plot(test_2)
f.set_size_inches(5, 12)

从图表中可以看出,未经过差异的,未经过转换的数据并不完全相同。 test[3]返回50test_2[3]返回36.857864376269056 enter image description here

1 个答案:

答案 0 :(得分:0)

解决方案:

## original
x = np.array([1,1,1,50,1,1,1,1,1,1,1,1,40,1,1,2,1,1,1,1,1])

## sqrt
x_sq = np.sqrt(x)

## diff
d_sq = np.diff(x_sq,n=1)

## Only works when d = 1
def diffinv(d,i):
    inv = np.insert(d,0,i)
    inv = np.cumsum(inv)
    return inv

## inv diff 
y_sq = diffinv(d_sq,x_sq[0])

## Check inv diff
(y_sq==x_sq).all()