Azure oozie工作流程

时间:2016-10-03 12:49:24

标签: azure hadoop sqoop oozie hdinsight

我正在尝试在azurehdinsight集群上运行oozie工作流,作业定义如下所示:

<workflow-app xmlns="uri:oozie:workflow:0.2" name="oozie-sqoop">
  <start to="sqoop1" />
  <action name="sqoop1">
    <sqoop xmlns="uri:oozie:sqoop-action:0.4">
      <job-tracker>jobtrackerhost:9010</job-tracker>
      <name-node>wasb://abc@def.blob.core.windows.net</name-node>
      <configuration>
        <property>
          <name>mapred.job.queue.name</name>
          <value>default</value>
        </property>
      </configuration>
      <arg>import</arg>
      <arg>--connect</arg>
      <arg>jdbc:mysql://{ip}/svnadmin</arg>
      <arg>--username</arg>
      <arg>uname</arg>
      <arg>--password</arg>
      <arg>password</arg>
      <arg>--table</arg>
      <arg>rights</arg>
      <arg>--hive-import</arg>
    </sqoop>
    <ok to="end" />
    <error to="fail" />
  </action>
  <kill name="fail">
    <message>sqoop failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message>
  </kill>
  <end name="end" />
</workflow-app>

日志:

2016-10-04 06:16:06,816  INFO ActionStartXCommand:520 - SERVER[hn0-saint.3oitbdwtly0uzabcmledackovts0a.bx.internal.cloudapp.net] USER[saint] GROUP[-] TOKEN[] APP[oozie-sqoop] JOB[0000015-160928235712742-oozie-oozi-W] ACTION[0000015-160928235712742-oozie-oozi-W@:start:] Start action [0000015-160928235712742-oozie-oozi-W@:start:] with user-retry state : userRetryCount [0], userRetryMax [0], userRetryInterval [10]
2016-10-04 06:16:06,827  INFO ActionStartXCommand:520 - SERVER[hn0-saint.3oitbdwtly0uzmledackovts0a.bx.internal.cloudapp.net] USER[saint] GROUP[-] TOKEN[] APP[oozie-sqoop] JOB[0000015-160928235712742-oozie-oozi-W] ACTION[0000015-160928235712742-oozie-oozi-W@:start:] [***0000015-160928235712742-oozie-oozi-W@:start:***]Action status=DONE
2016-10-04 06:16:06,828  INFO ActionStartXCommand:520 - SERVER[hn0-saint.3oitbdwtly0uzmleklasackovts0a.bx.internal.cloudapp.net] USER[saint] GROUP[-] TOKEN[] APP[oozie-sqoop] JOB[0000015-160928235712742-oozie-oozi-W] ACTION[0000015-160928235712742-oozie-oozi-W@:start:] [***0000015-160928235712742-oozie-oozi-W@:start:***]Action updated in DB!
2016-10-04 06:16:07,508  INFO WorkflowNotificationXCommand:520 - SERVER[hn0-saint.3oitbdxbtly0uzmledackovts0a.bx.internal.cloudapp.net] USER[-] GROUP[-] TOKEN[-] APP[-] JOB[0000015-160928235712742-oozie-oozi-W] ACTION[] No Notification URL is defined. Therefore nothing to notify for job 0000015-160928235712742-oozie-oozi-W

但它没有向hive导入任何内容。当我尝试使用sqoop命令时,它成功并成功导入到配置单元。 它总是显示状态为正在运行,它永远不会改变为任何东西

1 个答案:

答案 0 :(得分:1)

看起来您的WASB路径错误,不确定您是否将其更改为发布目的。

我相信它应该如下。

<name-node>wasbs://abc@def.blob.core.windows.net</name-node>

您缺少 s ,请检查。