我正在训练一个具有恒定0准确度的网络,我知道网络没有学习。我尝试了不同的批量大小和学习率,但没有帮助。鉴于下面显示的网络原型和解算器,可能出现什么问题?谢谢!
layer {
name: "data"
type: "HDF5Data"
top: "X"
top: "y"
hdf5_data_param{
source:"/A/B/trainlist.txt"
batch_size: 1
}
include{phase: TRAIN}
}
layer {
name: "data"
type: "HDF5Data"
top: "X"
top: "y"
hdf5_data_param{
source:"/A/B/testlist.txt"
batch_size: 1
}
include{phase: TEST}
}
这是solver.prototxt
net: "/A/B/train.prototxt"
test_iter: 10
test_interval: 1000
base_lr: 0.01
lr_policy: "step"
gamma: 0.1
stepsize: 1000
display: 10
max_iter: 4000
momentum: 0.9
weight_decay: 0.0005
snapshot: 1000
snapshot_prefix: "/A/B/model_"
solver_mode: GPU