Question

我在页面下面用anaconda构建了caffe2。

在具有单个titanx的服务器中，具有cudnn7和cuda9但没有nccl，因此我从nvidia下载了nccl2并将其提取到path / to / local / nccl2，然后编辑./pytorch/conda/第42行的“ integrated / build.sh”为：“ export NCCL_ROOT_DIR = path / to / local / nccl2”。

然后我需要将caffe2与python2一起使用，因此我在./pytorch/scripts/build_anaconda.sh中添加了“ conda_args + =（” --python 2.7“）”以使用python2.7。

构建成功，但是当我从caffe2.python import core运行python2 test.py

它告诉我：

警告：root：此caffe2 python运行不支持GPU。将在仅CPU模式下运行。

警告：root：调试消息：没有名为caffe2_pybind11_state_hip的模块

分段错误（核心已转储）

我的问题是：

a。为什么conda不支持GPU？

b。如果我使用的是单个GPU，则是否需要使用nccl进行构建？

c。如何修复没有名为caffe2_pybind11_state_hip的模块

PyTorch或Caffe2：caffe2

如何安装PyTorch（conda，pip，源代码）：conda

您使用的构建命令（如果从源代码编译）：./ scripts / build_anaconda.sh --install-locally --cuda 9.0 --cudnn 7

OS：ubuntu16

PyTorch版本：

Python版本：2.7

CUDA / cuDNN版本：9.1 / 7

GPU模型和配置：??

GCC版本（如果从源代码编译）：5.4.0

CMake版本：未安装

任何其他相关库的版本：

非常感谢您！

Answer 1

首先获取CUDA并安装它：

sudo apt-get update && sudo apt-get install wget -y --no-install-recommends
wget "http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_8.0.61-1_amd64.deb"
sudo dpkg -i cuda-repo-ubuntu1604_8.0.61-1_amd64.deb
sudo apt-get update
sudo apt-get install cuda

现在从源代码继续安装（在环境中进行安装）：

FULL_CAFFE2 = 1 python setup.py install

您可以在此处找到更多信息：https://caffe2.ai/docs/getting-started.html?platform=ubuntu&configuration=compile#install-with-gpu-support

Answer 2

按照下面对我有用的程序

ubuntu@test:~$ cd $HOME

ubuntu@test:~$ conda create -n caffe2

ubuntu@test:~$ source activate caffe2

(caffe2) ubuntu@test:~$ git clone --recursive https://github.com/pytorch/pytorch.git && cd pytorch


(caffe2) ubuntu@test:~/pytorch$ git submodule update --init


(caffe2) ubuntu@test:~/pytorch$ CONDA_INSTALL_LOCALLY=1 ./scripts/build_anaconda.sh --cuda 8.0 --cudnn 7 -DUSE_CUDA=ON -DUSE_NCCL=ON

用conda构建caffe2失败

2 个答案: