带有CUDA istallation问题的Openmpi

时间:2019-04-29 15:11:17

标签: openmpi

尝试安装具有CUDA支持的Openmpi时,我遇到了一些make文件故障。

btl_uct_module.c: In function ‘mca_btl_uct_reg_mem’:
btl_uct_module.c:214:22: error: ‘UCT_MD_MEM_ACCESS_REMOTE_GET’ undeclared (first use in this function)
         uct_flags |= UCT_MD_MEM_ACCESS_REMOTE_GET;
                      ^
btl_uct_module.c:214:22: note: each undeclared identifier is reported only once for each function it appears in
btl_uct_module.c:217:22: error: ‘UCT_MD_MEM_ACCESS_REMOTE_PUT’ undeclared (first use in this function)
         uct_flags |= UCT_MD_MEM_ACCESS_REMOTE_PUT;
                      ^
btl_uct_module.c:220:22: error: ‘UCT_MD_MEM_ACCESS_REMOTE_ATOMIC’ undeclared (first use in this function)
         uct_flags |= UCT_MD_MEM_ACCESS_REMOTE_ATOMIC;
                      ^
btl_uct_module.c:225:21: error: ‘UCT_MD_MEM_ACCESS_ALL’ undeclared (first use in this function)
         uct_flags = UCT_MD_MEM_ACCESS_ALL;
                     ^
Makefile:1912: recipe for target 'btl_uct_module.lo' failed
make[2]: *** [btl_uct_module.lo] Error 1
make[2]: Leaving directory '/home/usama/install/openmpi-4.0.1/opal/mca/btl/uct'
Makefile:2375: recipe for target 'all-recursive' failed
make[1]: *** [all-recursive] Error 1
make[1]: Leaving directory '/home/usama/install/openmpi-4.0.1/opal'
Makefile:1893: recipe for target 'all-recursive' failed
make: *** [all-recursive] Error 1

我使用以下命令进行配置然后安装。

./configure --prefix=/home/$USER/.openmpi --with-cuda
make all install

我正在使用以下配置:

  

Ubuntu 16.04

     

CUDA 10.1

     

CuDNN 7.5

     

Openmpi 4.0.1

奇怪的是,我试图在装有Ubuntu 18.04的本地计算机上进行相同的安装,并且安装正常。是否存在兼容性问题?有什么想法吗?

1 个答案:

答案 0 :(得分:0)

结果证明这毕竟是兼容性问题。使用openmpi 3.1.4解决了该问题。