MPI Slurm退出代码254?

时间:2015-05-18 17:45:34

标签: c mpi slurm

我正在尝试使用MPI运行C文件。我对MPI还是比较新的,所以我不确定出了什么问题。我试着计算一个功能需要多长时间,测量与串行版本相比的加速比。时间码如下:

int my_rank;
MPI_Init(0,0);
double start, elapsed, local_elapsed;
int iters; 
// compute page rank

start = MPI_Wtime();
iters = solve(S, C, d, y0, y, eps, strawman_mvpSM);
local_elapsed = MPI_Wtime() - start;
MPI_Reduce(&local_elapsed, &elapsed, 1, MPI_DOUBLE, MPI_MAX, 0,           MPI_COMM_WORLD);
if(my_rank == 0) {
    printf("final (page rank) probabilities, %d iterations in time %f\n", iters, elapsed);
    printvec(y, n); printf("\n");
}
MPI_Finalize();

使用mpicc -g -Wall -std=c99 -o pagerank-mpi pagerank-mpi.c

编译代码

代码使用srun -n 2 ./pagerank-mpi 1000

运行

运行时,我得到以下输出:

[0] Abort: free_vbuf_head = NULL at line 175 in file vbuf.c
[1] Abort: free_vbuf_head = NULL at line 175 in file vbuf.c
initial probabilities
(0.001, 0.001, 0.001, 0.001, 0.001, 0.001,
0.001, 0.001, 0.001, 0.001, 0.001, 0.001, ), sum is 1
dimension is 1000, nnz is 9097, damper is 0.850000, epsilon is 1e-09

initial probabilities
(0.001, 0.001, 0.001, 0.001, 0.001, 0.001,
0.001, 0.001, 0.001, 0.001, 0.001, 0.001, ), sum is 1
dimension is 1000, nnz is 9097, damper is 0.850000, epsilon is 1e-09

srun: error: node25: tasks 0-1: Exited with exit code 254

我找不到退出代码254的含义,我也不知道vbuf.c是什么。代码无法打印最终概率,但会将初始概率打印两次,这是它不应该做的事情。我主要关注的是Abort线和退出代码。任何人都可以解释那些在做什么以及为什么?

0 个答案:

没有答案