使用MPI-IO读取多个文件

时间:2012-04-21 06:46:16

标签: c file-io matrix io mpi

我正在尝试使用C中的MPI-IO读取多个文件。我正在关注此示例:http://users.abo.fi/Mats.Aspnas/PP2010/examples/MPI/readfile1.c

然而,我在一个矩阵中读取双打而不是一串字符。这是实施:

/*
Simple MPI-IO program that demonstrate parallel reading from a file.
Compile the program with 'mpicc -O2 readfile1.c -o readfile1'
*/

#include <stdlib.h>
#include <stdio.h>
#include "mpi.h"

#define FILENAME "filename.dat"

double** ArrayAllocation() {
    int i;
    double** array2D;
    array2D= (double**) malloc(num_procs*sizeof(double*));
    for(i = 0; i < num_procs; i++) {
        twoDarray[i] = (double*) malloc(column_size*sizeof(double));
    }
    return array2D;
}

int main(int argc, char* argv[]) {
  int i, np, myid;
  int bufsize, nrchar;
  double *buf;          /* Buffer for reading */
  double **matrix = ArrayAllocation();
  MPI_Offset filesize;
  MPI_File myfile;    /* Shared file */ 
  MPI_Status status;  /* Status returned from read */

  /* Initialize MPI */
  MPI_Init(&argc, &argv);
  MPI_Comm_rank(MPI_COMM_WORLD, &myid);
  MPI_Comm_size(MPI_COMM_WORLD, &np);

  /* Open the files */
  MPI_File_open (MPI_COMM_WORLD, FILENAME, MPI_MODE_RDONLY,
         MPI_INFO_NULL, &myfile);

  /* Get the size of the file */
  MPI_File_get_size(myfile, &filesize);
  /* Calculate how many elements that is */
  filesize = filesize/sizeof(double);
  /* Calculate how many elements each processor gets */
  bufsize = filesize/np;
  /* Allocate the buffer to read to, one extra for terminating null char */
  buf = (double *) malloc((bufsize)*sizeof(double));
  /* Set the file view */
  MPI_File_set_view(myfile, myid*bufsize*sizeof(double), MPI_DOUBLE,
             MPI_DOUBLE,"native", MPI_INFO_NULL);
  /* Read from the file */
  MPI_File_read(myfile, buf, bufsize, MPI_DOUBLE, &status);
  /* Find out how many elemyidnts were read */
  MPI_Get_count(&status, MPI_DOUBLE, &nrchar);
  /* Set terminating null char in the string */
  //buf[nrchar] = (double)0;
  printf("Process %2d read %d characters: ", myid, nrchar);

  int j;
  for (j = 0; j <bufsize;j++){
    matrix[myid][j] = buf[j];
  }

  /* Close the file */
  MPI_File_close(&myfile);

  if (myid==0) {
    printf("Done\n");
  }

  MPI_Finalize();
  exit(0);
}

但是当我在关闭第一个文件后尝试调用MPI_File_open时,出现错误。我需要多个沟通者才能执行此操作吗?任何提示将不胜感激。

1 个答案:

答案 0 :(得分:1)

上面ArrayAllocation中的代码与主程序的逻辑不完全匹配。在初始化MPI之前,矩阵被分配为指向双向量的指针数组,因此无法将行数设置为MPI进程数。

在确定文件大小之前,还不知道column_size

C语言中的一般约定是按行存储矩阵。违反此约定可能会使您或您的代码的读者感到困惑。

总而言之,为了使这个程序正常工作,你需要声明

 int num_procs, column_size;

作为ArrayAllocation定义之前的全局变量,并将对此函数的调用移到计算bufsize的行下方:

 ...
 /* Calculate how many elements each processor gets */
 bufsize = filesize/np;

 num_procs = np;
 column_size = bufsize;
 double **matrix = ArrayAllocation();
 ...

通过上述修改,此示例应适用于任何支持MPI-IO的MPI实现。我用OpenMPI 1.2.8进行了测试。

为了生成测试文件,您可以使用以下代码:

 FILE* f = fopen(FILENAME,"w");
 double x = 0;
 for(i=0;i<100;i++){
   fwrite(&x, 1,sizeof(double), f);
   x +=0.1;
 }
 fclose(f);