sqoop导入数据到hive抛出ERROR org.apache.sqoop.hive.HiveConfig?

时间:2016-10-31 04:06:47

标签: sqoop

  1. 我在Ambari HDP 2.5.0上安装了HUE 3.10

  2. 完全配置hue.ini

  3. 我的问题是从sqql到hive的var sqoop同步数据,它抛出异常:

    [main] ERROR org.apache.sqoop.hive.HiveConfig – Could not load org.apache.hadoop.hive.conf.HiveConf. Make sure HIVE_CONF_DIR is set correctly. 
    
    [main] ERROR org.apache.sqoop.hive.HiveConfig – Could not load org.apache.hadoop.hive.conf.HiveConf. Make sure HIVE_CONF_DIR is set correctly. 
    
    [main] ERROR org.apache.sqoop.tool.ImportTool – Encountered IOException running import job: java.io.IOException: java.lang.ClassNotFoundException: org.apache.hadoop.hive.conf.HiveConf
    
    at org.apache.sqoop.hive.HiveConfig.getHiveConf(HiveConfig.java:50)
        at org.apache.sqoop.hive.HiveImport.getHiveArgs(HiveImport.java:397)
        at org.apache.sqoop.hive.HiveImport.executeExternalHiveScript(HiveImport.java:384)
        at org.apache.sqoop.hive.HiveImport.executeScript(HiveImport.java:342)
        at org.apache.sqoop.hive.HiveImport.importTable(HiveImport.java:246)
        at org.apache.sqoop.tool.ImportTool.importTable(ImportTool.java:524)
        at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:615)
        at org.apache.sqoop.Sqoop.run(Sqoop.java:147)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:76)
        at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:183)
        at org.apache.sqoop.Sqoop.runTool(Sqoop.java:225)
        at org.apache.sqoop.Sqoop.runTool(Sqoop.java:234)
        at org.apache.sqoop.Sqoop.main(Sqoop.java:243)
        at org.apache.oozie.action.hadoop.SqoopMain.runSqoopJob(SqoopMain.java:202)
        at org.apache.oozie.action.hadoop.SqoopMain.run(SqoopMain.java:182)
        at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:51)
        at org.apache.oozie.action.hadoop.SqoopMain.main(SqoopMain.java:48)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:242)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
        at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1724)
        at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)
    

    但是,如果在命令行中执行相同的sqoop脚本,它可以工作!

    添加了环境变量HADOOP_CLASSPATH=$HADOOP_CLASSPATH:/usr/hdp/current/hive-client/lib to /etc/profile。它仍然无效。我试过一些时间自己解决这个问题,但是发誓。

    脚本为/usr/hdp/2.5.0.0-1245/hive/bin/hive。看起来像${HADOOP_CLASSPATH} point to /usr/hdp/2.5.0.0-1245/atlas/hook/hive/* ?

        #!/bin/bash
    
    
        if [ -d "/usr/hdp/2.5.0.0-1245/atlas/hook/hive" ]; then
          if [ -z "${HADOOP_CLASSPATH}" ]; then
            export HADOOP_CLASSPATH=/usr/hdp/2.5.0.0-1245/atlas/hook/hive/*
          else
            export HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/usr/hdp/2.5.0.0-1245/atlas/hook/hive/*
          fi
        fi
    
    
        BIGTOP_DEFAULTS_DIR=${BIGTOP_DEFAULTS_DIR-/etc/default}
        [ -n "${BIGTOP_DEFAULTS_DIR}" -a -r ${BIGTOP_DEFAULTS_DIR}/hbase ] && . ${BIGTOP_DEFAULTS_DIR}/hbase
    
    
    
    
        export HIVE_HOME=${HIVE_HOME:-/usr/hdp/2.5.0.0-1245/hive}
        export HADOOP_HOME=${HADOOP_HOME:-/usr/hdp/2.5.0.0-1245/hadoop}
        export ATLAS_HOME=${ATLAS_HOME:-/usr/hdp/2.5.0.0-1245/atlas}
    
    
        HCATALOG_JAR_PATH=/usr/hdp/2.5.0.0-1245/hive-hcatalog/share/hcatalog/hive-hcatalog-core-1.2.1000.2.5.0.0-1245.jar:/usr/hdp/2.5.0.0-1245/hive-hcatalog/share/hcatalog/hive-hcatalog-server-extensions-1.2.1000.2.5.0.0-1245.jar:/usr/hdp/2.5.0.0-1245/hive-hcatalog/share/webhcat/java-client/hive-webhcat-java-client-1.2.1000.2.5.0.0-1245.jar
    
    
        if [ -z "${HADOOP_CLASSPATH}" ]; then
          export HADOOP_CLASSPATH=${HCATALOG_JAR_PATH}
        else
          export HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:${HCATALOG_JAR_PATH}
        fi
    
    
        exec "${HIVE_HOME}/bin/hive.distro" "$@"
    

    如何解决这个问题?

1 个答案:

答案 0 :(得分:0)

对我来说,这个问题在Ambari工作流程编辑器中显示出来。要解决此问题,请在每个sqoop客户端节点中创建一个符号链接,以便在hive-exec.jar所在的位置配置lib。 接下来,将hive-exec.jar放入HDFS oozie共享库文件夹中。

su root

cd /usr/hdp/current/sqoop-client/

ln -s /usr/hdp/current/hive-client/lib/hive-exec.jar hive-exec.jar

cp hive-exec.jar lib/

su -l hdfs 

hdfs dfs -put hive-exec.jar /user/oozie/share/lib/sqoop

hdfs dfs -put hive-exec.jar /user/oozie/share/lib/lib_20161117191926/sqoop