Question

我的问题是，当我生成2个Rslaves时，我的OpenMP进程似乎固定在一个内核上。当我在SAME作业中生成3个从属服务器时（当我的作业配置允许时），我可以跨越分配的内核。

这是我在做什么：

在集群上启动作业，例如使用3个节点，每个节点3个任务（以便我可以尝试）和每个任务4个核心的交互式（批处理给出相似的结果）作业：

salloc -A arctest --gres=gpu:1 --partition=v100_dev_q -N 3 --tasks-per-node=2 --cpus-per-task=4 --time=2:00:00 srun -n1 -N1 --pty --preserve-env --cpu-bind=no --mpi=pmi2 --distribution=cycl ic:cyclic $SHELL

加载一些软件： module load intel/18.2 R/3.6.1 openmpi/4.0.1 R-parallel/3.6.1 cuda/10.1.168 我们正在使用SLURM 17.11（我认为），并且启用了cgroups，UCX 1.3.0，版本为4.2.1.2。 OpenMPI使用以下命令进行编译：

              --with-cma \
            --enable-dlopen \
            --enable-shared \
            --with-mxm=/opt/mellanox/mxm \
            --with-pmi=/usr \
            --with-slurm

所有这些都在Mellanox ConnectX-5卡上。

cgroups.conf
CgroupAutomount=yes
ConstrainCores=yes
ConstrainDevices=yes
ConstrainRAMSpace=yes
ConstrainSwapSpace=yes
AllowedSwapSpace=4

在主节点上运行快速OpenMP，以查看我是否不局限于单个核心： ./openmp_example

openmp_example.c
#include <stdio.h>
#include <omp.h>
#include <sched.h>
#include <unistd.h>

int main() {
  #pragma omp parallel num_threads(10)
  {
    char hostbuffer[256];
    int hostname;
    hostname = gethostname(hostbuffer, sizeof(hostbuffer));
    int schedaff = sched_getaffinity();
    int coreid = sched_getcpu();
    int id = omp_get_thread_num();
    int total = omp_get_num_threads();
    int maxthread = omp_get_max_threads();
    //printf("host %s\n", hostbuffer);
    printf("Host: %s : core: %d , I am running process %d out of %d (max %d ) with affinity %d \n", hostbuffer, coreid, id, total, maxthread, schedaff);
  }
  printf("parallel for ends.\n");
  return 0;
}

获得类似于以下内容的输出

Host: ca223 : core: 0 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 8 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 2 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 6 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 8 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 10 , I am running process 1 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 4 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 12 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 14 , I am running process 5 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 18 , I am running process 0 out of 10 (max 12 ) with affinity -1

看起来不错，所以让我们在制作主机文件后添加一些R / Rmpi

hostfile
ca223 slots=2
ca224 slots=2

首先，让我们与2个奴隶一起这样做：

mpirun -v -np 1 --bind-to none --hostfile hostfile --mca mpi_warn_on_fork 0 --mca btl_openib_allow_ib 1 Rscript omp_test_S2.R

与此

omp_test_S2.R
library("Rmpi")

#
# In case R exits unexpectedly, have it automatically clean up
# resources taken up by Rmpi (slaves, memory, etc...)
.Last <- function(){
       if (is.loaded("mpi_initialize")){
           if (mpi.comm.size(1) > 0){
               print("Please use mpi.close.Rslaves() to close slaves.")
               mpi.close.Rslaves()
           }
           print("Please use mpi.quit() to quit R")
           .Call("mpi_finalize")
       }
}

Sys.setenv(OMP_NUM_THREADS = 12 )
Sys.setenv(OMP_PROC_BIND = "false")

cat("show quick core spread on master","\n",sep="")
system("./openmp_example")

ns <- mpi.universe.size()
cat("mpi.universe.size = ",ns,"\n",sep="")
ns <- 2
mpi.spawn.Rslaves(nslaves=ns)

# Tell all slaves to return a message identifying themselves
mpi.bcast.cmd( id <- mpi.comm.rank() )
mpi.bcast.cmd( ns <- mpi.comm.size() )
mpi.bcast.cmd( host <- mpi.get.processor.name() )
mpi.bcast.cmd( Sys.setenv(OMP_NUM_THREADS = 12 ))
mpi.bcast.cmd( Sys.setenv(OMP_PROC_BIND = "false") )
mpi.remote.exec(paste("I am",mpi.comm.rank(),"of",mpi.comm.size()))

mpi.bcast.cmd(system('./openmp_example >>slave_S2'))
mpi.bcast.cmd(system('env >> slave_env_S2'))

##system("./openmp_example")

# Tell all slaves to close down, and exit the program
mpi.close.Rslaves(dellog = FALSE)
mpi.quit()

哪个输出类似于以下内容：

core spread from slaves
Host: ca223 : core: 4 , I am running process 0 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 4 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 5 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 1 out of 10 (max 12 ) with affinity -1
parallel for ends.
Host: ca224 : core: 0 , I am running process 0 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 4 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 5 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 1 out of 10 (max 12 ) with affinity -1
parallel for ends.

好的，所以卡在从站上的内核4和内核0上。让我们尝试3个奴隶，相同的mpi调用，但是使用R脚本，将奴隶设置为3并查看输出：

core spread from slaves
Host: ca223 : core: 2 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 8 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 18 , I am running process 0 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 0 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 6 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 10 , I am running process 1 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 12 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 4 , I am running process 4 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 8 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca223 : core: 14 , I am running process 5 out of 10 (max 12 ) with affinity -1
parallel for ends.
Host: ca224 : core: 2 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 5 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 10 , I am running process 1 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 0 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 6 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 4 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 12 , I am running process 4 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 10 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 14 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 18 , I am running process 0 out of 10 (max 12 ) with affinity -1
parallel for ends.
Host: ca224 : core: 8 , I am running process 0 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 7 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 2 , I am running process 3 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 6 , I am running process 2 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 14 , I am running process 8 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 8 , I am running process 1 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 5 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 9 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 6 out of 10 (max 12 ) with affinity -1
Host: ca224 : core: 16 , I am running process 4 out of 10 (max 12 ) with affinity -1
parallel for ends.

那两个奴隶怎么办？

几件事：

OpenMPI 3（在使用-map-by时，主机文件部分已损坏，因此无法布置从站）
OpenMPI 4.0.2相似的结果
GCC / OpenMPI
不同的群集，相同版本的SLURM / R / etc
OpenMPI 1.10.7-遇到了我的bcast + system调用根本不吐出文件的问题
尝试编译OpenMPI --without-slurm，获取Run failed due to: pml_ucx.c:176 Error: Failed to receive UCX worker address并且没有尝试关闭UCX以使用ob1。好吧，给它一个快速的尝试，但是可以再努力一点。
1. --exlusive标志不会改变行为
2. salloc与脚本和sbatch相同的行为
3. 需要尝试非SLURM群集。也许有一些蛋酒...

还有其他人看到吗？我不太确定在哪里寻找问题。在我看来，这一切都适用于3个从属情况，因此R和Rmpi很明显，这是SLURM-OpenMPI问题吗？我需要查看环境变量吗？

在询问问题时，mpi.universe.size（）现在似乎与slurm中的配置匹配，即nodes = 2，tasks = 3的大小为6。如果我想简单地与节点数？理想情况下，这将是我通过mpi调用得到的，而不是not昧的环境。

作为后续，我可以看到--bind-to正在为-np 2工作：

mpirun -np 2 --bind-to none --hostfile hostfile --map-by ppr:1:node --mca mpi_warn_on_fork 0 --mca btl_openib_allow_ib 1 ./openmp_example
Host: ca207 : core: 2 , I am running process 8 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 0 , I am running process 9 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 14 , I am running process 0 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 8 , I am running process 2 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 6 , I am running process 3 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 4 , I am running process 4 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 10 , I am running process 1 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 12 , I am running process 6 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 0 , I am running process 5 out of 10 (max 8 ) with affinity -1
Host: ca207 : core: 8 , I am running process 7 out of 10 (max 8 ) with affinity -1
parallel for ends.
Host: ca208 : core: 2 , I am running process 9 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 0 , I am running process 8 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 14 , I am running process 0 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 6 , I am running process 3 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 10 , I am running process 1 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 4 , I am running process 4 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 12 , I am running process 6 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 2 , I am running process 5 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 8 , I am running process 2 out of 10 (max 8 ) with affinity -1
Host: ca208 : core: 8 , I am running process 7 out of 10 (max 8 ) with affinity -1
parallel for ends.

mpirun -np 2 --hostfile hostfile --map-by ppr:1:node --mca mpi_warn_on_fork 0 --mca btl_openib_allow_ib 1 ./openmp_example
Host: ca207 : core: 0 , I am running process 0 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 1 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 0 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 2 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 4 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 8 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 2 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 3 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 9 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 5 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 7 out of 10 (max 1 ) with affinity -1
Host: ca207 : core: 0 , I am running process 6 out of 10 (max 1 ) with affinity -1
parallel for ends.
Host: ca208 : core: 0 , I am running process 3 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 4 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 5 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 6 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 7 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 8 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 9 out of 10 (max 1 ) with affinity -1
Host: ca208 : core: 0 , I am running process 1 out of 10 (max 1 ) with affinity -1

SLURM / OpenMPI / R / Rmpi / OpenMP问题

0 个答案:

SLURM / OpenMPI / R / Rmpi​​ / OpenMP问题

0 个答案:

SLURM / OpenMPI / R / Rmpi / OpenMP问题