Spark 1.5.1独立集群 - 错误的Akka远程配置?

时间:2015-10-08 09:20:39

标签: apache-spark akka akka-remote-actor

使用Spark做我的第一步,我遇到了从应用程序代码向集群提交作业的问题。挖掘日志,我注意到主日志上有一些定期的WARN消息:

15/10/08 13:00:00 WARN remote.ReliableDeliverySupervisor: Association with remote system [akka.tcp://sparkDriver@192.168.254.167:64014] has failed, address is now gated for [5000] ms. Reason: [Disassociated]

问题是我们的网络上不存在IP地址,并且没有在任何地方配置。当它尝试执行任务时,工作日志上显示相同的错误ip(错误的ip传递给--driver-url):

15/10/08 12:58:21 INFO worker.ExecutorRunner: Launch command: "/usr/java/latest//bin/java" "-cp" "/path/spark/spark-1.5.1-bin-ha
doop2.6/sbin/../conf/:/path/spark/spark-1.5.1-bin-hadoop2.6/lib/spark-assembly-1.5.1-hadoop2.6.0.jar:/path/spark/
spark-1.5.1-bin-hadoop2.6/lib/datanucleus-api-jdo-3.2.6.jar:/path/spark/spark-1.5.1-bin-hadoop2.6/lib/datanucleus-rdbms-3.2.9.ja
r:/path/spark/spark-1.5.1-bin-hadoop2.6/lib/datanucleus-core-3.2.10.jar:/path/hadoop/2.6.0//etc/hadoop/" "-Xms102
4M" "-Xmx1024M" "-Dspark.driver.port=64014" "-Dspark.driver.port=53411" "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url"
"akka.tcp://sparkDriver@192.168.254.167:64014/user/CoarseGrainedScheduler" "--executor-id" "39" "--hostname" "192.168.10.214" "--cores" "16" "--app-id"  "app-20151008123702-0003" "--worker-url" "akka.tcp://sparkWorker@192.168.10.214:37625/user/Worker"
15/10/08 12:59:28 INFO worker.Worker: Executor app-20151008123702-0003/39 finished with state EXITED message Command exited with code 1 exitStatus 1

知道我做错了什么以及如何解决这个问题?

java版本是1.8.0_20,我使用预先构建的Spark二进制文件。

谢谢!

1 个答案:

答案 0 :(得分:0)

也许它会在my answer to a similar question为您提供一些线索,这与您的相似问题" 与远程系统的关联失败"