AssertionError:未分配的分区

时间:2016-04-03 03:30:47

标签: apache-kafka kafka-consumer-api kafka-python

我试图通过设置偏移来消耗主题中的数据但是得到断言错误 -

from kafka import KafkaConsumer

consumer = KafkaConsumer('foobar1',
                         bootstrap_servers=['localhost:9092'])
print 'process started'
print consumer.partitions_for_topic('foobar1')
print 'done'
consumer.seek(0,10)

for message in consumer:
    print ("%s:%d:%d: key=%s value=%s" % (message.topic, message.partition,
                                          message.offset, message.key,
                                          message.value))
print 'process ended'

错误: -

Traceback (most recent call last):
  File "/Users/pn/Documents/jobs/ccdn/kafka_consumer_1.py", line 21, in <module>
    consumer.seek(0,10)
  File "/Users/pn/.virtualenvs/vpsq/lib/python2.7/site-packages/kafka/consumer/group.py", line 549, in seek
    assert partition in self._subscription.assigned_partitions(), 'Unassigned partition'
AssertionError: Unassigned partition

3 个答案:

答案 0 :(得分:1)

在调用seek之前,您必须使用TopicPartitions列表调用consumer.assign()。 另请注意,seek的第一个参数也是TopicPartition。 见KafkaConsumer API

答案 1 :(得分:0)

Kafka 0.9kafka-python的情况下,分区分配在for message in consumer期间发生。因此,迭代后应该寻求操作。我通过以下代码重置了我的组的偏移量:

import kafka

ps = []
for i in xrange(topic_partition_number):
    ps.append(kafka.TopicPartition(topic, i))

consumer = kafka.KafkaConsumer(topic, bootstrap_servers=address, group_id=group)
for msg in consumer:
    print msg
    consumer.seek_to_beginning(*ps)
    consumer.commit()
    break

答案 2 :(得分:0)

以下是解决此问题的示例:

from kafka import KafkaConsumer, TopicPartition

con = KafkaConsumer(bootstrap_servers = my_bootstrapservers)
tp = TopicPartition(my_topic, 0)
con.assign([tp])
con.seek_to_beginning()
con.seek(tp, 1000000)

参考: kafka consumer seek is not working: AssertionError: Unassigned partition