打开MPI:4.0.1a
HostFile:
34bb0519eAAA
a2935f150BBB
我在机器34bb0519eAAA
中。而且我可以使用ssh a2935f150BBB
成功连接a2935f150BBB
。还要在机器34bb0519eAAA
中SSH a2935f150BBB
来成功连接34bb0519eAAA
。
但是当我mpiexec命令时。我收到错误消息
****Warning: Permanently added '[XX.XX.XX.XX]:XX' (a2935f150BBB'IP address) to the list of known hosts.**
----------------------**--------------------------------------
A process or daemon was unable to complete a TCP connection
to another process:
Local host: a2935f150BBB
Remote host: 34bb0519eAAA
This is usually caused by a firewall on the remote host. Please
check that any firewall (e.g., iptables) has been disabled and
ORTE was unable to reliably start one or more daemons.
This usually is caused by:
* not finding the required libraries and/or binaries on
one or more nodes. Please check your PATH and LD_LIBRARY_PATH
settings, or configure OMPI with --enable-orterun-prefix-by-default
* lack of authority to execute on one or more specified nodes.
Please verify your allocation and authorities.
* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
Please check with your sys admin to determine the correct location to use.
* compilation of the orted with dynamic libraries when static are required
(e.g., on Cray). Please check your configure cmd line and consider using
one of the contrib/platform definitions for your system type.
* an inability to create a connection back to mpirun due to a
lack of common network interfaces and/or no route found between
them. Please check network connectivity (including firewalls
and network routing requirements).
我对此感到非常困惑。因为我彼此成功运行了ssh。怎么会失败。
这是ssh连接
ssh a2935f150BBB
警告:已将“ [[XX.XX.XX.XX]:XX]”永久添加到已知主机列表中。
欢迎使用Ubuntu 18.04.1 LTS(XXXXXXXXXXXXXXXXXX)
已通过删除以下软件包和内容使此系统最小化: 在用户未登录的系统上不需要。
要恢复此内容,可以运行“ unminimize”命令。 上次登录:XXXXXXXXXX的XXXXXXXXXXXXX