LXC容器中的Hadoop错误:YARN:1/1 local-dirs很糟糕

时间:2015-03-09 19:05:57

标签: ubuntu hadoop yarn lxc

我在同一主机上的两个lxc容器内运行了两个hadoop实例,一个hadoop-master和一个hadoop-slave1。 在开始YARN&主人的DFS我为hadoop-slave1获得了这个UNHEALTHY状态。

对于我在网上发现的内容,它必须是以下两种可能之一:

  • 磁盘空间不足。
  • 许可问题

一个。 df -h另有说法:

Filesystem      Size  Used Avail Use% Mounted on
/dev/sda5        91G   68G   19G  79% /
none            4,0K     0  4,0K   0% /sys/fs/cgroup
udev            3,8G  4,0K  3,8G   1% /dev
tmpfs           769M  1,3M  768M   1% /run
none            5,0M     0  5,0M   0% /run/lock
none            3,8G  536K  3,8G   1% /run/shm
none            100M   48K  100M   1% /run/user

湾  ll /usr/local/hadoop

............
drwxr-xr-x  2 hduser hadoop  4096 Mar  8 19:10 local/

ll /usr/local/hadoop/logs

total 132
drwxr-xr-x  3 hduser hadoop  4096 Mar  8 18:55 ./
drwxr-xr-x 12 hduser hadoop  4096 Mar  8 18:54 ../
-rw-r--r--  1 hduser hadoop 46222 Mar  8 18:55 hadoop-hduser-datanode-hadoop-slave1.log
-rw-r--r--  1 hduser hadoop   718 Mar  8 18:55 hadoop-hduser-datanode-hadoop-slave1.out
-rw-r--r--  1 hduser hadoop     0 Mar  8 17:08 SecurityAuth-hduser.audit
drwxr-xr-x  2 hduser hadoop  4096 Mar  8 19:10 userlogs/
-rw-r--r--  1 hduser hadoop 56645 Mar  8 19:08 yarn-hduser-nodemanager-hadoop-slave1.log
-rw-r--r--  1 hduser hadoop   702 Mar  8 18:56 yarn-hduser-nodemanager-hadoop-slave1.out

my yarn-site.xml:

<configuration>
        <property>
                <name>yarn.nodemanager.aux-services</name>
                <value>mapreduce_shuffle</value>
        </property>
        <property>
                <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
                <value> org.apache.hadoop.mapred.ShuffleHandler</value>
        </property>
        <property>
                <name>yarn.resourcemanager.resource-tracker.address</name>
                <value>hadoop-master:8025</value>
        </property>
        <property>
                <name>yarn.resourcemanager.scheduler.address</name>
                <value>hadoop-master:8030</value>
        </property>
        <property>
                <name>yarn.resourcemanager.address</name>
                <value>hadoop-master:8050</value>
        </property>
        <property>
                <name>yarn.nodemanager.local-dirs</name>
                <value>file:///usr/local/hadoop/local</value>
        </property>
</configuration>

ResourceManager抱怨的错误:

INFO org.apache.hadoop.yarn.server.resourcemanager.rmnode.RMNodeImpl: Node hadoop-slave1:48673 reported UNHEALTHY with details: 1/1 local-dirs are bad: /usr/local/hadoop/local; 1/1 log-dirs are bad: /usr/local/hadoop/logs/userlogs

他们说here我需要制作一些磁盘空间,lxc-container的磁盘空间是否与主机操作系统相同? 有任何想法吗 ? 感谢

编辑:

我在NodeManager错误日志中完全忽略了这一点:

2015-03-10 08:15:17,671 WARN org.apache.hadoop.yarn.server.nodemanager.DirectoryCollection: Directory /usr/local/hadoop/local error, Directory is not executable: /usr
/local/hadoop/local, removing from list of valid directories
2015-03-10 08:15:17,671 WARN org.apache.hadoop.yarn.server.nodemanager.DirectoryCollection: Directory /usr/local/hadoop/logs/userlogs error, Directory is not executab
le: /usr/local/hadoop/logs/userlogs, removing from list of valid directories
2015-03-10 08:15:17,671 INFO org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService: Disk(s) failed: 1/1 local-dirs are bad: /usr/local/hadoop/local; 1/1 l
og-dirs are bad: /usr/local/hadoop/logs/userlogs
2015-03-10 08:15:17,671 ERROR org.apache.hadoop.yarn.server.nodemanager.LocalDirsHandlerService: Most of the disks failed. 1/1 local-dirs are bad: /usr/local/hadoop/l
ocal; 1/1 log-dirs are bad: /usr/local/hadoop/logs/userlogs

0 个答案:

没有答案