为什么multiprocessing.pool的这种实现不起作用?

时间:2014-07-14 00:15:49

标签: python multithreading numpy multiprocessing sympy

以下是我正在使用的代码:

def initFunction(arg1, arg2):
    def funct(value):
        return arg1 * arg2 * value
    return funct

os.system("taskset -p 0xff %d" % os.getpid()) 
pool = Pool(processes=4)
t = np.linspace(0,1,10e3)

a,b,c,d,e,f,g,h = sy.symbols('a,b,c,d,e,f,g,h',commutative=False)

arg1 = sy.Matrix([[a,b],[c,d]])
arg2 = sy.Matrix([[e,f],[g,h]])
myFunct = initFunction(arg1, arg2)

m3 = map(myFunct,t) # this works
m4 = pool.map(myFunct,t) # this does NOT work

我得到的错误是:

Traceback (most recent call last):
   File "<stdin>", line 1, in <module>
   File "/usr/lib/python2.7/dist-packages/spyderlib/widgets/externalshell/sitecustomize.py", line 540, in runfile
      execfile(filename, namespace)
   File "/home/justin/Research/mapTest.py", line 46, in <module>
      m4 = pool.map(myFunct,t) 
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 251, in map
      return self.map_async(func, iterable, chunksize).get()
   File "/usr/lib/python2.7/multiprocessing/pool.py", line 558, in get
      raise self._value
cPickle.PicklingError: Can't pickle <type 'function'>: attribute lookup __builtin__.function failed

那么这个错误意味着什么?如何对这个地图函数进行多处理呢?

1 个答案:

答案 0 :(得分:7)

使用multiprocessing时在流程之间传递的对象必须可从子级中的__main__模块so that they can be unpickled导入。嵌套函数(如funct)无法从__main__导入,因此您会收到该错误。您可以使用functools.partial代替

来实现您的目标
from multiprocessing import Pool
from functools import partial

def funct(arg1, arg2, value):
    return arg1 * arg2 * value


if __name__ == "__main__":
    t = [1,2,3,4]
    arg1 = 4 
    arg2 = 5 

    pool = Pool(processes=4)
    func = partial(funct, arg1, arg2)
    m4 = pool.map(func,t)
    print(m4)

输出:

[20, 40, 60, 80]