如何在Hadoop集群中加载native-hadoop库?

时间:2014-05-24 01:08:24

标签: java r hadoop cloudera-cdh revolution-r

我在Cloudera Hadoop(CDH)4.6版中运行Revolution R Enterprise 7.0版来生成 决策树,使用MapReduce服务。

当我运行Hadoop集群计算上下文时,似乎未加载native-hadoop库。我加载了RevoScaleR软件包。

我已经查看过Revolution Analitics博客,pdf手册和这个论坛,但我还没有找到解决方案。

> rxSetComputeContext(myHadoopCluster)
> BetterTree <- rxDTree(CountryCode ~ IndicatorCode + 1960 + 1961 + 1962 + 1963 + 1964 + 1965 + 1966 + 1967 + 1968 + 1969 + 1970 + 1971 + 1972 + 1973 + 1974 + 1975 + 1976 + 1977 + 1978 + 1979 + 1980 + 1981 + 1982 + 1983 + 1984 + 1985 + 1986 + 1987 + 1988 + 1989 + 1990 + 1991 + 1992 + 1993 + 1994 + 1995 + 1996 + 1997 + 1998 + 1999 + 2000 + 2001 + 2002 + 2003 + 2004 + 2005 + 2006 + 2007 + 2008 + 2009 + 2010 + 2011 + 2012 + 2013, data=salida, blocksPerRead=30, maxUnorderedLevels = 1300,  cp=1e-5)
======  localhost.localdomain (Master HPA Process) has started run at Fri May 23 16:45:42 2014  ======
RxInitializeHadoop sSystemCommand: hadoop RevoScaleR -Dmapred.reduce.tasks=1     -Dmapred.min.split.size=9223372036854775807 /user/RevoShare/cloudera/F8D3BB9C6CDE411CA0F48520656EFE19    /.input /user/RevoShare/cloudera/F8D3BB9C6CDE411CA0F48520656EFE19/IRO.iro /share/better localhost.localdomain 8020 /usr/bin/Revoscript
14/05/23 16:46:40 WARN util.NativeCodeLoader: **Unable to load native-hadoop library for your  platform... using builtin-java classes where applicable**
RxInitializeHadoop after fixup sSystemCommand: hadoop  RevoScaleR  -Dmapred.reduce.tasks=1  -Dmapred.min.split.size=9223372036854775807  /user/RevoShare/cloudera/F8D3BB9C6CDE411CA0F48520656EFE19/.input  /user/RevoShare/cloudera/F8D3BB9C6CDE411CA0F48520656EFE19/IRO.iro  /share/better/*  localhost.localdomain  8020  /usr/bin/Revoscript
Exception in thread "main" java.lang.NoClassDefFoundError: RevoScaleR
Caused by: java.lang.ClassNotFoundException: RevoScaleR
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
Could not find the main class: RevoScaleR.  Program will exit.

HadoopMR output object '/user/RevoShare/cloudera/F8D3BB9C6CDE411CA0F48520656EFE19/IRO.iro' does not exist. Job has failed.
Error in rxCall("RxDTree", params) :
Error:  Error in rxCall("RxDTree", params) :

======  localhost.localdomain (Master HPA Process) has completed run at Fri May 23 16:51:50 2014  ======
Error in rxuHandleClusterJobTryFailure(retObject, hpcServerJob, autoCleanup,  :
Error completing job on cluster:
Error in rxCall("RxDTree", params) :

我在R符号中编写了以下命令以检查RevoScaleR,但未加载。

> library(RevoScaleR)
> is.loaded("RevoScaleR")
[1] FALSE

你能给我一些建议吗?

感谢。

1 个答案:

答案 0 :(得分:0)

你的问题不是原生的hadoop库。这只是一个警告。可能是你的hadoop类路径中没有RevoScaleR mapreduce jar。尝试将该jar复制到你的hadoop lib文件夹中。它应该被命名为scaleR-hadoop-0.1-SNAPSHOT.jar,可能在/usr/lib64/Revo-7.x/hadoop/scripts之下。我不确定路径,因为我无法访问7.0安装。