如何使用mxnet中的lstm-ctc检查点进行预测?

时间:2016-12-10 02:00:37

标签: lstm mxnet

我按照示例/ warpctc / lstm_ocr.py来训练模型。现在我保存了一个检查点mymodel-0100.params和mymodel-symbol.json。
那么,如何使用此检查点进行预测只使用一个图像?

我已经厌倦了使用Predictor界面,代码如下:

# Load the pre-trained model
symbol_file = "mymodel-symbol.json"
param_file = "mymodel-0100.params"
predictor = Predictor(open(symbol_file).read(),
    open(param_file).read(),
    {'data':(80, 30)})

但数据形状总是会引发错误,我不知道如何设置此值。有人帮我谢谢。

然而,我也尝试了另一种方式: 在mxnet / example / warpctc / lstm_ocr.py的末尾添加一行代码:

model = mx.model.FeedForward(ctx=contexts,
                             symbol=symbol,
                             num_epoch=num_epoch,
                             learning_rate=learning_rate,
                             momentum=momentum,
                             wd=0.00001,
                             initializer=mx.init.Xavier(factor_type="in", magnitude=2.34))

model.fit(X=data_train, eval_data=data_val,
          eval_metric = mx.metric.np(Accuracy),
          batch_end_callback=mx.callback.Speedometer(BATCH_SIZE, 50),)

model.save("ocr")

# add new line for predict
model.predict(data_val)

但它始终是错误输出:

Traceback (most recent call last):
File "lstm_ctc_ocr.py", line 211, in <module>
training_all()
File "lstm_ctc_ocr.py", line 188, in training_all
model.predict(data_val)
File "/home/bobliu/Work/code/DL/mxnet/python/mxnet/model.py", line 618, in predict
self._init_predictor(data_shapes, type_dict)
File "/home/bobliu/Work/code/DL/mxnet/python/mxnet/model.py", line 541, in _init_predictor
self.ctx[0], grad_req='null', type_dict=type_dict, **dict(input_shapes))
File "/home/bobliu/Work/code/DL/mxnet/python/mxnet/symbol.py", line 685, in simple_bind
arg_types, _, aux_types = self.infer_type(**type_dict)
File "/home/bobliu/Work/code/DL/mxnet/python/mxnet/symbol.py", line 417, in infer_type
ctypes.byref(complete)))
File "/home/bobliu/Work/code/DL/mxnet/python/mxnet/base.py", line 77, in check_call
raise MXNetError(py_str(_LIB.MXGetLastError()))
mxnet.base.MXNetError: InferType Error in reshape0: [21:39:03] src/operator/./reshape-inl.h:345: Check failed: (dtype) != (-1) First input must have specified type

1 个答案:

答案 0 :(得分:0)

阅读完源代码后。我知道我现在有什么问题。

1。重新发送到predictor()函数。

我们必须使用完整符号设置数据形状 在这个例子中:

    init_c = [('l%d_init_c'%l, (batch_size, num_hidden)) for l in range(num_lstm_layer)]
    init_h = [('l%d_init_h'%l, (batch_size, num_hidden)) for l in range(num_lstm_layer)]
    init_states = init_c + init_h

    all_shapes = [('data', (batch_size, 100 * 30))] + init_states + [('label', (batch_size, num_label))]

因此,您必须设置所有形状的数据。

2。重新升级到model.predict

我认为源代码中出现了一个错误,但我不确定我是不是错了 请检查model.py文件中的预测功能 更改以下代码(注意X.provide_label):

    data_shapes = X.provide_data + X.provide_label
    data_names = [x[0] for x in data_shapes]
    type_dict = dict((key, value.dtype) for (key, value) in self.arg_params.items())
    for x in X.provide_data + X.provide_label:
        if isinstance(x, DataDesc):
            type_dict[x.name] = x.dtype
        else:
            type_dict[x[0]] = mx_real_t

并且这一点:

        for o_list, o_nd in zip(output_list, self._pred_exec.outputs[0:real_size]):
            o_list.append(o_nd.asnumpy())

现在,一切都与我合作。