我是Digits和TX2的新手。我正在尝试使用来自https://github.com/dusty-nv/jetson-inference
的教程来创建对象检测模型我成功创建了数据集。问题出在模型上
在创建模型时,出现以下错误。
Memory required for data: 3268934784
creating layer bbox_loss
Creating Layer bbox_loss
bbox_loss <- bboxes-obj-masked-norm
bbox_loss <- bbox-obj-label-norm
bbox_loss -> loss_bbox
Setting up bbox_loss
Top shape: (1)
with loss weight 2
Memory required for data: 3268934788
Creating layer coverage_loss
Creating Layer coverage_loss
coverage_loss <- coverage_coverage/sig_0_split_0
coverage_loss <- coverage-label_slice-label_4_split_0
coverage_loss -> loss_coverage
Setting up coverage_loss
Top shape: (1)
with loss weight 1
Memory required for data: 3268934792
Creating layer cluster
左侧的作业目录信息为:
Job Directory
/home/nvidia/DIGITS/digits/jobs/20180816-161051-e67a
Disk Size
0 B
Network (train/val)
train_val.prototxt
Network (deploy)
deploy.prototxt
Network (original)
original.prototxt
Solver
solver.prototxt
Raw caffe output
caffe_output.log
Pretrained Model
/home/nvidia/bvlc_googlenet.caffemodel.4
Visualizations
Tensorboard
服务器上的错误是
2018-08-16 16:10:53 [20180816-161051-e67a] [INFO ] Task subprocess args: "/home/nvidia/Caffe/caffe/build/tools/caffe train --solver=/home/nvidia/DIGITS/digits/jobs/20180816-161051-e67a/solver.prototxt --gpu=0 --weights=/home/nvidia/bvlc_googlenet.caffemodel.4"
2018-08-16 16:11:00 [20180816-161051-e67a] [ERROR] Train Caffe Model task failed with error code 1
我不知道如何释放内存,因为作业目录中有2 GB以上的可用空间。 请帮我。预先感谢。
答案 0 :(得分:0)
最近几天出现了同样的问题,也许将来会对某人有所帮助。首先,请确保您具有正确的protobuf版本。您可以使用以下方法进行检查:
protoc --version
如果是2. *,则必须更新到3. *,例如按照此处https://github.com/NVIDIA/DIGITS/blob/digits-6.0/docs/BuildProtobuf.md的说明进行构建,然后重新构建Caffe。另外,请确保您具有protobuf的pip软件包的兼容版本。对我而言,以下https://github.com/dusty-nv/jetson-inference教程中的Digits和Caffe的当前版本运行良好:
pip install --user --upgrade protobuf==3.1.0.post1