无法在slurm,ubuntu16.04服务器中的matlab中创建parpool

时间:2017-04-02 05:08:59

标签: matlab slurm

我在slurm

中使用matlab创建parpool时遇到了一些麻烦

当我提交作业时,它会被卡住:

parpool size:24

Starting parallel pool (parpool) using the 'local' profile ...

or error

{Error using parpool (line 104)
Failed to start a parallel pool. (For information in addition to the causing
error, validate the profile 'local' in the Cluster Profile Manager.)


Error in run (line 86)
evalin('caller', [script ';']);

Caused by:
    Error using parallel.internal.pool.InteractiveClient>iThrowWithCause (line
    666)
    Failed to initialize the interactive session.
        Error using
        parallel.internal.pool.InteractiveClient>iThrowIfBadParallelJobStatus
        (line 767)
        The interactive communicating job failed with no message.
} 

还有一个matlab故障转储文件

------------------------------------------------------------------------
       Segmentation violation detected at Sun Apr  2 11:36:33 2017
------------------------------------------------------------------------

Configuration:
  Crash Decoding      : Disabled - No sandbox or build area path
  Crash Mode          : continue (default)
  Default Encoding    : UTF-8
  GNU C Library       : 2.23 stable
  Host Name           : wmc-slave-g2
  MATLAB Architecture : glnxa64
  MATLAB Root         : /opt/matlab/R2017a
  MATLAB Version      : 9.2.0.538062 (R2017a)
  Operating System    : Linux 4.4.0-66-generic #87-Ubuntu SMP Fri Mar 3 15:29:05 UTC 2017 x86_64
  Processor ID        : x86 Family 6 Model 79 Stepping 1, GenuineIntel

Fault Count: 1


Abnormal termination:
Segmentation violation

Register State (from fault):
  RAX = 00007f7410256900  RBX = 0000000000000000
  RCX = 0000000000000000  RDX = 00007f7410256978
  RSP = 00007f741e240868  RBP = 00007f741e240870
  RSI = 0000000000000000  RDI = 00007f741e240870

   R8 = 0000000000000000   R9 = 0000000000000000
  R10 = 00000000000000ed  R11 = 00007f743afade60
  R12 = 00007f7410256978  R13 = 00007f741e2408a0
  R14 = 00007f741e2409f0  R15 = 00007f7410258110

  RIP = 00007f743afade60  EFL = 0000000000010202

   CS = 0033   FS = 0000   GS = 0000

Stack Trace (from fault):
[  0] 0x00007f743afade60 /opt/matlab/R2017a/bin/glnxa64/libboost_thread.so.1.56.0+00069216 _ZNK5boost6thread15get_thread_infoEv+00000000
[  1] 0x0000000000000000                                   <unknown-module>+00000000

如果此问题可重现,请通过以下方式提交服务请求:

技术支持工程师可能会就更多信息与您联系。

由于

1 个答案:

答案 0 :(得分:0)

这通常是由于Linux上的Matlab使用的java版本不兼容。问题:

echo $MATLAB_JAVA

如果设置了此变量,则可以取消设置此选项并让Matlab使用自己的java。你可以运行这样的脚本:

#!/bin/sh unset MATLAB_JAVA matlab -desktop