我们在第一步中无法构建流式多维数据集:从kafka保存数据,这是输出日志:
Counters: 12
Job Counters
Failed map tasks=4
Launched map tasks=4
Other local map tasks=3
Rack-local map tasks=1
Total time spent by all maps in occupied slots (ms)=10836
Total time spent by all reduces in occupied slots (ms)=0
Total time spent by all map tasks (ms)=10836
Total vcore-seconds taken by all map tasks=10836
Total megabyte-seconds taken by all map tasks=11096064
Map-Reduce Framework
CPU time spent (ms)=0
Physical memory (bytes) snapshot=0
Virtual memory (bytes) snapshot=0
谁能告诉我如何解决它?
答案 0 :(得分:0)
当我运行kylin流多维数据集构建时,遇到了同样的问题。当我打开纱线日志时,我看到了
错误:java.lang.ClassNotFoundException:org.apache.kafka.clients.consumer.Consumer在java.net.URLClassLoader.findClass(URLClassLoader.java:381)在java.lang.ClassLoader.loadClass(ClassLoader.java: 424),位于org.apache.kylin.source.kafka.hadoop.KafkaInputFormat.createRecordReader,位于sun.misc.Launcher $ AppClassLoader.loadClass(Launcher.java:331),位于java.lang.ClassLoader.loadClass(ClassLoader.java:357), (KafkaInputFormat.java:107)在org.apache.hadoop.mapred.MapTask $ NewTrackingRecordReader。(MapTask.java:515)在org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:758)在org.apache org.apache.hadoop.mapred.YarnChild $ 2.run(YarnChild.java:164)处的java.security.AccessController.doPrivileged(本机方法)处的.hadoop.mapred.MapTask.run(MapTask.java:341)。在org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1758)处的security.auth.Subject.doAs(Subject.java:422)在org.apache.hadoop.mapred.YarnChild.main(YarnChild.java: 158)`
按照说明here,我设置了KAFKA_HOME
,然后重新启动了kylin
。没有这样的日志。