我正在编写一个mpi python代码。例如,四个过程有以下数据:
data on procs0: [1, 2, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0]
data on procs1: [0, 0, 0, 4, 5, 6, 0, 0, 0, 0, 0, 0]
data on procs2: [0, 0, 0, 0, 0, 0, 7, 8, 9, 0, 0, 0]
data on procs3: [0, 0, 0, 0, 0, 0, 0, 0, 0, 10, 11, 12]
我想在mpi4py库中使用reduce函数来减少procs0上的数据,结果如下:
result on procs0: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12]
如何使用mpi4py lib函数创建它?
编辑: 以上是一个简单的特例,设置不能使用,请看下面的另一个案例:
data on procs0: [1,0,0,0,0,0]
data on procs1: [0,2,0,0,0,0]
data on procs2: [0,0,0,3,0,0]
data on procs3: [0,0,0,0,4,5]
理想的结果必须是:
result on procs0: [1,2,0,3,4,5]
答案 0 :(得分:5)
我不确定你的问题是否需要数据总和或最大值。我用mpi Reduce
函数编写了一个简单的例子,它计算总和。
#!/usr/bin/env python
import numpy as np
from mpi4py import MPI
comm = MPI.COMM_WORLD
comm.Barrier()
t_start = MPI.Wtime()
# this array lives on each processor
data = np.zeros(5)
for i in xrange(comm.rank, len(data), comm.size):
# set data in each array that is different for each processor
data[i] = i
# print out the data arrays for each processor
print '[%i]'%comm.rank, data
comm.Barrier()
# the 'totals' array will hold the sum of each 'data' array
if comm.rank==0:
# only processor 0 will actually get the data
totals = np.zeros_like(data)
else:
totals = None
# use MPI to get the totals
comm.Reduce(
[data, MPI.DOUBLE],
[totals, MPI.DOUBLE],
op = MPI.SUM,
root = 0
)
# print out the 'totals'
# only processor 0 actually has the data
print '[%i]'%comm.rank, totals
comm.Barrier()
t_diff = MPI.Wtime() - t_start
if comm.rank==0: print t_diff
将此代码保存在文件reduce_test.py
中并使用命令mpirun -np 3 ./reduce_test.py
运行它在我的机器上输出以下内容:
[0] [ 0. 0. 0. 3. 0.]
[1] [ 0. 1. 0. 0. 4.]
[2] [ 0. 0. 2. 0. 0.]
[1] None
[2] None
[0] [ 0. 1. 2. 3. 4.]
0.00260496139526
请注意,将op = MPI.SUM
调用中的参数comm.Reduce
更改为op = MPI.MAX
将计算最大值而非总和。
答案 1 :(得分:0)
将list comprehension与zip一起使用,存储每列的最大值
In [1]: procs0=[1, 2, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0]
In [2]: procs1=[0, 0, 0, 4, 5, 6, 0, 0, 0, 0, 0, 0]
In [3]: procs2=[0, 0, 0, 0, 0, 0, 7, 8, 9, 0, 0, 0]
In [4]: procs3=[0, 0, 0, 0, 0, 0, 0, 0, 0, 10, 11, 12]
In [5]: [max(i) for i in zip(procs0, procs1, procs2, procs3)]
Out[5]: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12]