ProcessPoolExecutor,BrokenProcessPool处理

时间:2018-10-02 22:41:31

标签: python python-3.x concurrency future

在本文档(https://pymotw.com/3/concurrent.futures/)中说:

“ ProcessPoolExecutor与ThreadPoolExecutor的工作方式相同,但是使用进程而不是线程。这允许CPU密集型操作使用单独的CPU,而不会被CPython解释器的全局解释器锁阻止。” em>

听起来不错!它还说:

“如果工作进程之一发生某种故障导致其意外退出,则ProcessPoolExecutor被视为“中断”,将不再计划任务。”

这听起来很糟糕:(所以我想我的问题是:什么被认为是“出乎意料?”是否仅表示退出信号不为1?我可以安全地退出线程并继续处理队列吗?示例如下:如下:

from concurrent import futures
import os
import signal


with futures.ProcessPoolExecutor(max_workers=2) as ex:
    print('getting the pid for one worker')
    f1 = ex.submit(os.getpid)
    pid1 = f1.result()

    print('killing process {}'.format(pid1))
    os.kill(pid1, signal.SIGHUP)

    print('submitting another task')
    f2 = ex.submit(os.getpid)
    try:
        pid2 = f2.result()
    except futures.process.BrokenProcessPool as e:
        print('could not start new tasks: {}'.format(e))

1 个答案:

答案 0 :(得分:0)

我没有看到它的IRL,但是从代码中看,返回的文件描述符似乎不包含results_queue文件描述符。

来自current.futures.process:

    reader = result_queue._reader

    while True:
        _add_call_item_to_queue(pending_work_items,
                                work_ids_queue,
                                call_queue)

        sentinels = [p.sentinel for p in processes.values()]
        assert sentinels
        ready = wait([reader] + sentinels)
        if reader in ready:  # <===================================== THIS
            result_item = reader.recv()
        else:
            # Mark the process pool broken so that submits fail right now.
            executor = executor_reference()
            if executor is not None:
                executor._broken = True
                executor._shutdown_thread = True
                executor = None
            # All futures in flight must be marked failed
            for work_id, work_item in pending_work_items.items():
                work_item.future.set_exception(
                    BrokenProcessPool(
                        "A process in the process pool was "
                        "terminated abruptly while the future was "
                        "running or pending."
                    ))
                # Delete references to object. See issue16284
                del work_item

wait函数取决于系统,但是假设Linux OS(在multiprocessing.connection,删除了所有与超时相关的代码):

    def wait(object_list, timeout=None):
        '''
        Wait till an object in object_list is ready/readable.

        Returns list of those objects in object_list which are ready/readable.
        '''
        with _WaitSelector() as selector:
            for obj in object_list:
                selector.register(obj, selectors.EVENT_READ)

            while True:
                ready = selector.select(timeout)
                if ready:
                    return [key.fileobj for (key, events) in ready]
                else:
                    # some timeout code