Kafka python消费者以并行线程运行

时间:2018-06-11 23:52:06

标签: python apache-kafka kafka-python

我是python和kafka的新手。我有一个脚本,应该启动三个kafka消费者,等待来自这些消费者的消息,并做一些其他的事情。在这一点上,我甚至不知道我是否朝着正确的方向前进,所以任何帮助都将受到赞赏。

class MainClass():
    def do_something_before(self):
        # something is done here

    def start_consumer(self):
        consumer1_thread = threading.Thread(target=self.cons1, args=())
        consumer2_thread = threading.Thread(target=self.cons2, args=())
        consumer1_thread.daemon = True
        consumer2_thread.daemon = True
        consumer1_thread.start()
        consumer2_thread.start()

    def cons1(self):
        consumer = KafkaConsumer(bootstrap_servers='localhost:9092',
                                 auto_offset_reset='earliest')
        consumer.subscribe(['my-topic'])
        for message in consumer:
            print(message.value)

    def cons2(self):
        consumer = KafkaConsumer(bootstrap_servers='localhost:9092',
                                 auto_offset_reset='earliest')
        consumer.subscribe(['my2-topic'])
        for message in consumer:
            print(message.value)

    def keep_working(self):
        # something is done here

if __name__ == 'main':
    g = MainClass()
    g.do_something_before()
    g.keep_working()

2 个答案:

答案 0 :(得分:1)

我添加了具有2个使用者(基本上是两个python进程)的python-kafka示例,您可以在github链接https://github.com/Shubhamgorde/kafka-python-app上找到它。

不能发布整个python文件,它有点大。

from multiprocessing import Process
def consumeData(topic):
    try:
         consumer = KafkaConsumer(topic, value_deserializer=lambda v: 
           binascii.unhexlify(v).decode('utf-8'))
    except:
         print("Error!!")
for msg in consumer:
    msg=ast.literal_eval(msg.value)
    if(msg[2] == 'C'):
        performCreditOperation(msg)
    elif (msg[2] == 'D'):
          performDebitOperation(msg)
t1 = Process(target=consumeData, args=('Credit_transac',))
t2 = Process(target=consumeData, args=('Debit_transac',))
t1.start()
t2.start()

答案 1 :(得分:0)

这是我的实现。希望您觉得有用。

class ConsumerThread:
    def __init__(self, config, topics):
        self.config = config
        self.topics = topics

    def readData(self):
        consumer = Consumer(self.config)
        consumer.subscribe(self.topics)
        self.run(consumer)

    def process_msg(self, msg):
        print('Received message.')
        print('Key: {}, Val: {}'.format(msg.key(), msg.value()))
        print('Partition: {}, Offset: {}'.format(msg.partition(), msg.offset()))

    def run(self, consumer):
        try:
            while True:
                msg = consumer.poll(0.1)
                if not msg:
                    continue
                if msg.error():
                    if msg.error().code() == KafkaError._PARTITION_EOF:
                        # End of partition event
                        print('End of partition reached {0}/{1}'
                            .format(msg.topic(), msg.partition()))
                    else:
                        raise KafkaException(msg.error())
                else:
                    self.process_msg(msg)

        except KeyboardInterrupt:
            print("Detected Keyboard Interrupt. Cancelling.")
            pass

        finally:
            consumer.close()