YARN ResourceTrackerService在STARTED状态下失败

时间:2014-11-02 21:41:50

标签: hadoop yarn

我正在尝试在共享磁盘上使用Hadoop Directory的几台计算机上设置hadoop集群。 HDFS运作良好。但是当我尝试启动YARN时,ResourceTracker会抛出一个BindException。资源跟踪器配置为运行的节点(ahti.d.umn.edu-131.212.41.9)是可访问的(我可以SSH到它),端口(28025)也是打开的。

org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService failed in state STARTED; cause: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.net.BindException: Problem binding to [ahti.d.umn.edu:28025] java.net.BindException: Cannot assign requested address; For more details see:  http://wiki.apache.org/hadoop/BindException
    org.apache.hadoop.yarn.exceptions.YarnRuntimeException: java.net.BindException: Problem binding to [ahti.d.umn.edu:28025] java.net.BindException: Cannot assign requested address; For more details see:  http://wiki.apache.org/hadoop/BindException
        at org.apache.hadoop.yarn.factories.impl.pb.RpcServerFactoryPBImpl.getServer(RpcServerFactoryPBImpl.java:139)
        at org.apache.hadoop.yarn.ipc.HadoopYarnProtoRPC.getServer(HadoopYarnProtoRPC.java:65)
        at org.apache.hadoop.yarn.ipc.YarnRPC.getServer(YarnRPC.java:54)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceTrackerService.serviceStart(ResourceTrackerService.java:159)
        at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
        at org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:120)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$RMActiveServices.serviceStart(ResourceManager.java:503)
        at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startActiveServices(ResourceManager.java:898)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$1.run(ResourceManager.java:938)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$1.run(ResourceManager.java:935)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1614)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.transitionToActive(ResourceManager.java:935)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:979)
        at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
        at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:1104)

以下是我的yarn-site.xml

<configuration>
<property>
    <name>yarn.resourcemanager.hostname</name>
    <value>131.212.41.9</value>
</property>

<property>
 <name>yarn.resourcemanager.resource-tracker.address</name>       
 <value>131.212.41.9:28025</value>  
</property>

<property>       
 <name>yarn.resourcemanager.scheduler.address</name>       
 <value>131.212.41.9:8030</value>  
</property>

<property>       
 <name>yarn.resourcemanager.address</name>       
 <value>131.212.41.9:8050</value>  
</property>

<property>       
 <name>yarn.resourcemanager.admin.address</name>       
 <value>131.212.41.9:8041</value>  
</property>

<property>       
 <name>yarn.nodemanager.local-dirs</name>       
 <value>/scratch/dfs/yarn</value>  
</property>

<property>       
 <name>yarn.log.dir</name>       
 <value>/scratch/hadoop/yarn/logs</value>  
</property>

</configuration>

如果重要的话我正在运行java-8。

有关如何修复它的任何线索?

1 个答案:

答案 0 :(得分:0)

看起来可能是因为两个原因

可能是已经运行的资源管理器的其他一些实例使用该端口。杀死该资源管理器实例并重新开始。使用命令ps aux | grep -i resourcemanager查找资源管理器的进程ID,然后使用命令kill -9 <RESOURCE_MANAGER_PID>

将其终止

Hadoop并不完全支持JDK-8。有关Hadoop支持的Java版本,请参阅link,如果选项1不起作用,请尝试将Java版本降级为JDK7