我正在使用:https://github.com/mumrah/kafka-python作为Python中的kafka api。我想获取指定主题的分区数。我该怎么做?
答案 0 :(得分:8)
可能是一个稍微简单的解决方案,但是:
from kafka import KafkaClient
client = KafkaClient('SERVER:PORT')
topic_partition_ids = client.get_partition_ids_for_topic(b'TOPIC')
len(topic_partition_ids)
在Python 3.4.3 / kafka-python 0.9.3
上测试答案 1 :(得分:2)
我在试图解决这个完全相同的问题时发现了这个问题。我知道问题已经过时了,但这是我提出的解决方案(使用Kazoo与zookeeper交谈):
from kazoo.client import KazooClient
class KafkaInfo(object):
def __init__(self, hosts):
self.zk = KazooClient(hosts)
self.zk.start()
def topics(self):
return self.zk.get_children('/brokers/topics')
def partitions(self, topic):
strs = self.zk.get_children('/brokers/topics/%s/partitions' % topic)
return map(int, strs)
def consumers(self):
return self.zk.get_children('/consumers')
def topics_for_consumer(self, consumer):
return self.zk.get_children('/consumers/%s/offsets' % consumer)
def offset(self, topic, consumer, partition):
(n, _) = self.zk.get('/consumers/%s/offsets/%s/%d' % (consumer, topic, partition))
return int(n)
答案 2 :(得分:0)
对于那些使用Confluent-Python或企业API的用户。这可以通过以下方式完成:
def count_partitions(my_partitions) -> int:
count = 0
for part in my_partitions:
count = count + 1
return count
cluster_data: ClusterMetadata = producer.list_topics(topic=TOPIC)
topic_data: TopicMetadata = cluster_data.topics[TOPIC]
available_partitions: PartitionMetadata = topic_data.partitions
print(count_partitions(available_partitions))
答案 3 :(得分:0)
Python 3.8.10/kafka-python 2.0.2 解决方案:
from kafka import KafkaConsumer
def get_partitions_number(server, topic):
consumer = KafkaConsumer(
topic,
bootstrap_servers=server
)
partitions = consumer.partitions_for_topic(topic)
return len(partitions)