使用Flume将数据从kafka提取到HDFS :: ConfigurationException:必须指定Bootstrap Server

时间:2018-03-12 11:25:30

标签: hadoop apache-kafka flume flume-ng

我正在尝试使用从kafka源到hdfs的flume来提取数据。下面是我的flume conf文件。

flume1.sources = kafka-source-1
flume1.channels = hdfs-channel-1
flume1.sinks = hdfs-sink-1
flume1.sources.kafka-source-1.type = org.apache.flume.source.kafka.KafkaSource
flume1.sources.kafka-source-1.bootstrap.servers = localhost:9092
flume1.sources.kafka-source-1.zookeeperConnect = localhost:2181
flume1.sources.kafka-source-1.topic = MYNEWSFEEDS
flume1.sources.kafka-source-1.batchSize = 100
flume1.sources.kafka-source-1.channels = hdfs-channel-1
flume1.channels.hdfs-channel-1.type   = memory
flume1.sinks.hdfs-sink-1.channel = hdfs-channel-1
flume1.sinks.hdfs-sink-1.type = hdfs
flume1.sinks.hdfs-sink-1.hdfs.writeFormat = Text
flume1.sinks.hdfs-sink-1.hdfs.fileType = DataStream
flume1.sinks.hdfs-sink-1.hdfs.filePrefix = test-events
flume1.sinks.hdfs-sink-1.hdfs.useLocalTimeStamp = true
flume1.sinks.hdfs-sink-1.hdfs.path = hdfs://quickstart.cloudera:8020/tmp
flume1.sinks.hdfs-sink-1.hdfs.rollCount=100
flume1.sinks.hdfs-sink-1.hdfs.rollSize=0
flume1.channels.hdfs-channel-1.capacity = 10000
flume1.channels.hdfs-channel-1.transactionCapacity = 1000

我使用以下命令运行水槽代理:

sudo flume-ng agent --name flume1 --conf-file '/etc/flume-ng/conf/flafka.conf' Dflume.root.logger=TRACE,console

但我收到以下错误:

  

18/03/12 16:49:18 ERROR node.AbstractConfigurationProvider:Source   由于配置期间出错,kafka-source-1已被删除   org.apache.flume.conf.ConfigurationException:Bootstrap服务器必须   在...指定   org.apache.flume.source.kafka.KafkaSource.doConfigure(KafkaSource.java:330)     在   org.apache.flume.source.BasicSourceSemantics.configure(BasicSourceSemantics.java:65)     在   org.apache.flume.source.AbstractPollableSource.configure(AbstractPollableSource.java:63)     在   org.apache.flume.conf.Configurables.configure(Configurables.java:41)     在   org.apache.flume.node.AbstractConfigurationProvider.loadSources(AbstractConfigurationProvider.java:326)     在   org.apache.flume.node.AbstractConfigurationProvider.getConfiguration(AbstractConfigurationProvider.java:97)     在   org.apache.flume.node.PollingPropertiesFileConfigurationProvider $ FileWatcherRunnable.run(PollingPropertiesFileConfigurationProvider.java:140)     在   java.util.concurrent.Executors $ RunnableAdapter.call(Executors.java:471)     at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:304)     在   java.util.concurrent.ScheduledThreadPoolExecutor中的$ ScheduledFutureTask.access $ 301(ScheduledThreadPoolExecutor.java:178)     在   java.util.concurrent.ScheduledThreadPoolExecutor中的$ ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)     在   java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)     在   java.util.concurrent.ThreadPoolExecutor中的$ Worker.run(ThreadPoolExecutor.java:615)     在java.lang.Thread.run(Thread.java:745)

虽然,我已经在conf文件中指定了Bootstrap服务器,但它仍然给出了同样的错误。尝试了许多排列组合但没有成功。

1 个答案:

答案 0 :(得分:3)

根据official JavaDoc,您应该替换

flume1.sources.kafka-source-1.bootstrap.servers = localhost:9092

flume1.sources.kafka-source-1.kafka.bootstrap.servers = localhost:9092