Question

我在C API中有double A[B_ROWS][B_COLUMNS];我使用了类似的东西：

MPI_Isend(&A[low_bound][0], (upper_bound - low_bound) * A_COLUMNS, MPI_DOUBLE, i, MASTER_TO_SLAVE_TAG + 2, MPI_COMM_WORLD, &request);

和

 MPI_Recv(&A[low_bound][0], (upper_bound - low_bound) * A_COLUMNS, MPI_DOUBLE, 0, MASTER_TO_SLAVE_TAG + 2, MPI_COMM_WORLD, &status);

现在with boost::mpi我试试：

world.isend(i, TO_SLAVE_TAG + 2, &A[low_bound][0], (upper_bound - low_bound) * A_COLUMNS);

和

world.recv(0, TO_SLAVE_TAG + 2, &A[low_bound][0], (upper_bound - low_bound) * A_COLUMNS);

但我的应用程序经常失败，例如：

rank 1 in job 10  master_39934   caused collective abort of all ranks
  exit status of rank 1: killed by signal 11

这意味着seg fault，请注意原始C应用程序根据需要工作，我目前所有更改的都是使用api - 而不是任何逻辑。

那么在boost :: mpi上发送2d C样式数组的正确方法是什么？

Answer 1

假设我的盲猜是正确的，并且您输入的内容是准确的，A的大小与A_COLUMNS无关（相反，A有B_COLUMNS }）。如果是这样，下面的代码将修复那种“不同步”错误：

template<typename World, typename T>
void isend( World& w, int dest, int tag, T const* t, size_t n = 1) {
  world.isend(dest, tag, &t, n);
}
template<typename World, typename T, size_t aSize>
void isend( World& w, int dest, int tag, T const (*arr1)[aSize], size_t n = 1) {
  world.isend(dest, tag, &(*arr)[0], n*aSize);
}
template<typename World, typename T, size_t aSize, size_t bSize>
void isend( World& w, int dest, int tag, T const (*arr2)[aSize][bSize], size_t n = 1) {
  world.isend(dest, tag, &(*arr)[0][0], n*aSize*bSize);
}

template<typename World, typename T>
void recv( World& w, int dest, int tag, T* t, size_t n = 1) {
  world.recv(dest, tag, &t, n);
}
template<typename World, typename T, size_t aSize>
void recv( World& w, int dest, int tag, T (*arr1)[aSize], size_t n = 1) {
  world.recv(dest, tag, &(*arr)[0], n*aSize);
}
template<typename World, typename T, size_t aSize, size_t bSize>
void recv( World& w, int dest, int tag, T (*arr2)[aSize][bSize], size_t n = 1) {
  world.recv(dest, tag, &(*arr)[0][0], n*aSize*bSize);
}

对于一维和二维数组，上面的代码将确定你真正想要发送的T的副本数量，而不是你必须手动维护它。

它甚至适用于切片，例如&A[low_bound], upper_bound-lower_bound。

您可能需要注意的一件事就是超越阵列的末尾。很可能你的C代码已经超过了数组的末尾，但那里没有任何重要的东西，所以它幸免于难。在C ++代码中，你可以在那里拥有一个对象，而你却死了而不是生存。

另一种方法可能是编写一个同时获取切片上限和下限的函数，如下所示：

template<typename World, typename T, size_t N>
void isend_slice( World& w, int dest, int tag, T const (&t)[N], size_t start=0, size_t end=N ) {
  Assert( end <= N && start < end );
  isend(world, dest, tag, &(t[start]), end-start);
}
template<typename World, typename T, size_t N>
void recv_slice( World& w, int dest, int tag, T (&t)[N], size_t start=0, size_t end=N ) {
  Assert( end <= N && start < end );
  recv(world, dest, tag, &(t[start]), end-start);
}

在这种情况下，您直接传递一个数组，然后说出您想要开始和结束阅读的位置。优点是我检查数组实际上是否有要发送的数据，或者是数据到达的空间。

（这两个函数依赖于上面的函数）

在分布式情况下，您需要为Asserts生成描述性的日志记录机制。

以下是上述代码的示例用法：

int array[10];
int array2[10][10];
isend(world, dest, tag+0, &int(7)); // tag is an int
isend(world, dest, tag+1, &array); // tag+1 is a 10 int array
isend(world, dest, tag+2, &array2); // tag+2 is a 100 int array
isend(world, dest, tag+1, &(array2[5])); // tag+1 is a 10 int array
isend_slice(world, tag+3, 0, array2, 7, 11); // asserts, but its a 40 int array

和recv。同上。

如何通过Boost :: MPI发送2d C样式数组？

1 个答案: