Question

我使用Python进行深度学习。我的问题是时间序列预测问题，因为我想预测太阳黑子数量的演变。以下是自1749年以来太阳黑子的所有价值：http://www.sidc.be/silso/DATA/SN_ms_tot_V2.0.txt。

我想使用43个月的滑动窗口，因此我的数据集现在由44列和3170行组成（我想要预测的值是第44个，基于上个月的第43个月）。

我的数据如下：

135.90,137.90,140.20,143.80,146.40 ... 68.10,63.60,60.40

137.90,140.20,143.80,146.40,147.90，... 63.60,60.40,61.10

140.20,143.80,146.40,147.90,148.40，... 60.40,61.10,59.70

... 的

99.0,104.6,107.0,106.9,107.6，...... 27.80,26.50,25.70

我已将数据集划分为训练（前80％行）和验证（最后20％）。请参阅下面的代码：

import h2o
from h2o.estimators.deeplearning import H2ODeepLearningEstimator

h2o.init()

test=h2o.import_file("validationSet_43month.txt")
train=h2o.import_file("trainingSet_43month.txt")
l=train.shape[1] 
x=train.names[0:l-1] 
y=train.names[l-1]

Factiv="Tanh"
HiddenLayer=[100,100]
Nepochs=2000

model=H2ODeepLearningEstimator(
    activation=Factiv,
    hidden=HiddenLayer,
    epochs=Nepochs,
    reproducible=True,
    stopping_rounds=0, #I want to see an eventual overfitting on scoring history
    seed=123456789)
model.train(x=x,y=y,training_frame=train,validation_frame=test)

我想绘制得分历史以便知道要使用的最佳时期数，但我的得分历史似乎有很多噪音，有峰值（见图片）。 Scoring history on 10,000 epochs

zoom on 2,000 epochs for validation deviance

我以为我会得到这种类型的得分历史： Normal scoring history