我有10年的每日需求数据,呈正趋势。 https://gofile.io/?c=PS3YCO
在一年的最后三个月中,总会有1-2的需求冲击。和15-16。月份(促销)。
我尝试通过以下方式进行预测:
future = m.make_future_dataframe(periods = 365)
预测= m.predict(未来)
但是结果不是我期望的。我可以获得的最佳MSE是6681。但是当我尝试交叉验证时,结果几乎与6690相同。
也:当我使用“从fbprophet.diagnostics导入performance_metrics”来计算MSE时,它并没有为我提供测试数据的值,但是会持续更长的时间。我如何才能计算出去年的MSE?
非常感谢您的帮助:)
import numpy as np
from fbprophet import Prophet
import matplotlib.pyplot as plt
from fbprophet.diagnostics import cross_validation
df = pd.read_excel('Dataset2.3_kurz.xls')
promotions = pd.DataFrame({
'holiday': 'winter_promotion',
'ds': pd.to_datetime(['2009-10-1','2009-10-2','2009-10-15','2009-10-16',
'2009-11-1','2009-11-2','2009-11-15','2009-11-16',
'2009-12-1','2009-12-2','2009-12-15','2009-12-16',
'2010-10-1','2010-10-2','2010-10-15','2010-10-16',
'2010-11-1','2010-11-2','2010-11-15','2010-11-16',
'2010-12-1','2010-12-2','2010-12-15','2010-12-16',
'2011-10-1','2011-10-2','2011-10-15','2011-10-16',
'2011-11-1','2011-11-2','2011-11-15','2011-11-16',
'2011-12-1','2011-12-2','2011-12-15','2011-12-16',
'2012-10-1','2012-10-2','2012-10-15','2012-10-16',
'2012-11-1','2012-11-2','2012-11-15','2012-11-16',
'2012-12-1','2012-12-2','2012-12-15','2012-12-16',
'2013-10-1','2013-10-2','2013-10-15','2013-10-16',
'2013-11-1','2013-11-2','2013-11-15','2013-11-16',
'2013-12-1','2013-12-2','2013-12-15','2013-12-16',
'2014-10-1','2014-10-2','2014-10-15','2014-10-16',
'2014-11-1','2014-11-2','2014-11-15','2014-11-16',
'2014-12-1','2014-12-2','2014-12-15','2014-12-16',
'2015-10-1','2015-10-2','2015-10-15','2015-10-16',
'2015-11-1','2015-11-2','2015-11-15','2015-11-16',
'2015-12-1','2015-12-2','2015-12-15','2015-12-16',
'2016-10-1','2016-10-2','2016-10-15','2016-10-16',
'2016-11-1','2016-11-2','2016-11-15','2016-11-16',
'2016-12-1','2016-12-2','2016-12-15','2016-12-16',
'2017-10-1','2017-10-2','2017-10-15','2017-10-16',
'2017-11-1','2017-11-2','2017-11-15','2017-11-16',
'2017-12-1','2017-12-2','2017-12-15','2017-12-16',
'2018-10-1','2018-10-2','2018-10-15','2018-10-16',
'2018-11-1','2018-11-2','2018-11-15','2018-11-16',
'2018-12-1','2018-12-2','2018-12-15','2018-12-16',
'2019-10-1','2019-10-2','2019-10-15','2019-10-16',
'2019-11-1','2019-11-2','2019-11-15','2019-11-16',
'2019-12-1','2019-12-2','2019-12-15','2019-12-16']),
'lower_window': 0,
'upper_window': 0,
})
#model
m = Prophet( growth='linear',
holidays=promotions,
seasonality_mode='multiplicative',
holidays_prior_scale=10,
seasonality_prior_scale=10,
yearly_seasonality=True,
)
m.fit(df)
df_cv = cross_validation(m, initial='732 days', period='365 days', horizon = '366 days')
from fbprophet.diagnostics import performance_metrics
df_p = performance_metrics(df_cv)
df_p = df_p[-365:]
df_p.tail()
答案 0 :(得分:1)
尝试使用网格搜索并调整超参数。使用 changepoint_prior_scale 参数可能也有帮助。这只是个人观点,但将年度季节性设置为False并以新的Fourier顺序和优先级在外部添加它也可能会有所帮助。关于cross_validation,请检查this link以获得更好的理解。