MPI进程或守护程序无法完成TCP连接

时间:2019-01-24 01:43:34

标签: mpi openmpi

打开MPI:4.0.1a

HostFile:

  • 34bb0519eAAA
  • a2935f150BBB

我在机器34bb0519eAAA中。而且我可以使用ssh a2935f150BBB成功连接a2935f150BBB。还要在机器34bb0519eAAA中SSH a2935f150BBB来成功连接34bb0519eAAA

但是当我mpiexec命令时。我收到错误消息

****Warning: Permanently added '[XX.XX.XX.XX]:XX' (a2935f150BBB'IP address) to the list of known hosts.**
----------------------**--------------------------------------
A process or daemon was unable to complete a TCP connection
to another process:
  Local host:    a2935f150BBB
  Remote host:   34bb0519eAAA
This is usually caused by a firewall on the remote host. Please
check that any firewall (e.g., iptables) has been disabled and

ORTE was unable to reliably start one or more daemons.
This usually is caused by:

* not finding the required libraries and/or binaries on
  one or more nodes. Please check your PATH and LD_LIBRARY_PATH
  settings, or configure OMPI with --enable-orterun-prefix-by-default

* lack of authority to execute on one or more specified nodes.
  Please verify your allocation and authorities.

* the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base).
  Please check with your sys admin to determine the correct location to use.

*  compilation of the orted with dynamic libraries when static are required
  (e.g., on Cray). Please check your configure cmd line and consider using
  one of the contrib/platform definitions for your system type.

* an inability to create a connection back to mpirun due to a
  lack of common network interfaces and/or no route found between
  them. Please check network connectivity (including firewalls
  and network routing requirements).

我对此感到非常困惑。因为我彼此成功运行了ssh。怎么会失败。

这是ssh连接 ssh a2935f150BBB
警告:已将“ [[XX.XX.XX.XX]:XX]”永久添加到已知主机列表中。 欢迎使用Ubuntu 18.04.1 LTS(XXXXXXXXXXXXXXXXXX)

已通过删除以下软件包和内容使此系统最小化: 在用户未登录的系统上不需要。

要恢复此内容,可以运行“ unminimize”命令。 上次登录:XXXXXXXXXX的XXXXXXXXXXXXX

0 个答案:

没有答案