如何在单独的进程中运行Python自定义对象,它们都在共享事件队列上工作?

时间:2019-01-08 12:02:40

标签: python multiprocessing

我有4个不同的Python自定义对象和一个事件队列。每个对象都有一种方法,允许它从共享事件队列中检索事件,如果类型是所需的事件,则对其进行处理,然后将新事件放在同一事件队列中,以允许其他进程对其进行处理。

这是一个例子。

import multiprocessing as mp

class CustomObject:

    def __init__(events_queue: mp.Queue) -> None:
        self.events_queue = event_queue

    def process_events_queue() -> None:
        event = self.events_queue.get()
        if type(event) == SpecificEventDataTypeForThisClass:
            # do something and create a new_event
            self.events_queue.put(new_event)
        else:
            self.events_queue.put(event)

    # there are other methods specific to each object

这4个对象有特定的任务要做,但是它们都共享相同的结构。由于我需要“模拟”生产条件,因此我希望它们同时独立于彼此运行。

如果可能的话,这里只是我想做的一个例子。

import multiprocessing as mp
import CustomObject

if __name__ == '__main__':

    events_queue = mp.Queue()

    data_provider = mp.Process(target=CustomObject, args=(events_queue,))
    portfolio = mp.Process(target=CustomObject, args=(events_queue,))
    engine = mp.Process(target=CustomObject, args=(events_queue,))
    broker = mp.Process(target=CustomObject, args=(events_queue,))

    while True:
        data_provider.process_events_queue()
        portfolio.process_events_queue()
        engine.process_events_queue()
        broker.process_events_queue()

我的想法是在一个单独的进程中运行每个对象,使它们可以与通过events_queue共享的事件进行通信。所以我的问题是,我该怎么办?

问题在于obj = mp.Process(target=CustomObject, args=(events_queue,))返回一个Process实例,而我无法从中访问CustomObject方法。另外,有没有更聪明的方法来实现我想要的?

1 个答案:

答案 0 :(得分:2)

进程需要运行一个函数,该函数定义了进程实际在做什么。一旦该函数退出(并且没有非守护进程线程),该过程就完成了。这类似于Python本身始终执行__main__脚本的方式。

如果您执行mp.Process(target=CustomObject, args=(events_queue,)),则仅告诉该过程调用CustomObject-将其实例化一次,然后完成。这不是您想要的,除非类在实例化时实际执行了工作-出于其他原因,这是一个坏主意。

相反,您必须定义一个满足您需要的主要功能或方法:“与通过events_queue共享的事件进行通信”。此功能应侦听队列并根据收到的事件采取措施。

一个简单的实现如下所示:

import os, time
from multiprocessing import Queue, Process


class Worker:
    # separate input and output for simplicity
    def __init__(self, commands: Queue, results: Queue):
        self.commands = commands
        self.results = results

    # our main function to be run by a process
    def main(self):
        # each process should handle more than one command
        while True:
            value = self.commands.get()
            # pick a well-defined signal to detect "no more work"
            if value is None:
                self.results.put(None)
                break
            # do whatever needs doing
            result = self.do_stuff(value)
            print(os.getpid(), ':', self, 'got', value, 'put', result)
            time.sleep(0.2)  # pretend we do something
            # pass on more work if required
            self.results.put(result)

    # placeholder for what needs doing
    def do_stuff(self, value):
        raise NotImplementedError

这是仅处理事件的类的模板。 do_stuff方法必须重载以定义实际发生的情况。

class AddTwo(Worker):
    def do_stuff(self, value):
        return value + 2


class TimesThree(Worker):
    def do_stuff(self, value):
        return value * 3


class Printer(Worker):
    def do_stuff(self, value):
        print(value)

这已经定义了完全有效的进程有效负载:Process(target=TimesThree(in_queue, out_queue).main)在进程中调度main方法,侦听并响应命令。

运行此程序主要需要连接各个组件:

if __name__ == '__main__':
    # bookkeeping of resources we create
    processes = []
    start_queue = Queue()
    # connect our workers via queues
    queue = start_queue
    for element in (AddTwo, TimesThree, Printer):
        instance = element(queue, Queue())
        # we run the main method in processes
        processes.append(Process(target=instance.main))
        queue = instance.results
    # start all processes
    for process in processes:
        process.start()
    # send input, but do not wait for output
    start_queue.put(1)
    start_queue.put(248124)
    start_queue.put(-256)
    # send shutdown signal
    start_queue.put(None)
    # wait for processes to shutdown
    for process in processes:
        process.join()

请注意,您不需要为此的类。只要所有内容都是可腌制的,您还可以编写具有类似效果的函数:

import os, time
from multiprocessing import Queue, Process

def main(commands, results, do_stuff):
    while True:
        value = commands.get()
        if value is None:
            results.put(None)
            break
        result = do_stuff(value)
        print(os.getpid(), ':', do_stuff, 'got', value, 'put', result)
        time.sleep(0.2)
        results.put(result)

def times_two(value):
    return value * 2


if __name__ == '__main__':
    in_queue, out_queue = Queue(), Queue()
    worker = Process(target=main, args=(in_queue, out_queue, times_two))
    worker.start()
    for message in (1, 3, 5, None):
        in_queue.put(message)
    while True:
        reply = out_queue.get()
        if reply is None:
            break
        print('result:', reply)