Question

我试图通过设置偏移来消耗主题中的数据但是得到断言错误 -

from kafka import KafkaConsumer

consumer = KafkaConsumer('foobar1',
                         bootstrap_servers=['localhost:9092'])
print 'process started'
print consumer.partitions_for_topic('foobar1')
print 'done'
consumer.seek(0,10)

for message in consumer:
    print ("%s:%d:%d: key=%s value=%s" % (message.topic, message.partition,
                                          message.offset, message.key,
                                          message.value))
print 'process ended'

错误： -

Traceback (most recent call last):
  File "/Users/pn/Documents/jobs/ccdn/kafka_consumer_1.py", line 21, in <module>
    consumer.seek(0,10)
  File "/Users/pn/.virtualenvs/vpsq/lib/python2.7/site-packages/kafka/consumer/group.py", line 549, in seek
    assert partition in self._subscription.assigned_partitions(), 'Unassigned partition'
AssertionError: Unassigned partition

Answer 1

在调用seek之前，您必须使用TopicPartitions列表调用consumer.assign（）。另请注意，seek的第一个参数也是TopicPartition。见KafkaConsumer API

Answer 2

在Kafka 0.9和kafka-python的情况下，分区分配在for message in consumer期间发生。因此，迭代后应该寻求操作。我通过以下代码重置了我的组的偏移量：

import kafka

ps = []
for i in xrange(topic_partition_number):
    ps.append(kafka.TopicPartition(topic, i))

consumer = kafka.KafkaConsumer(topic, bootstrap_servers=address, group_id=group)
for msg in consumer:
    print msg
    consumer.seek_to_beginning(*ps)
    consumer.commit()
    break

Answer 3

以下是解决此问题的示例：

from kafka import KafkaConsumer, TopicPartition

con = KafkaConsumer(bootstrap_servers = my_bootstrapservers)
tp = TopicPartition(my_topic, 0)
con.assign([tp])
con.seek_to_beginning()
con.seek(tp, 1000000)

参考： kafka consumer seek is not working: AssertionError: Unassigned partition

AssertionError：未分配的分区

3 个答案: