气流德鲁伊钩不工作

时间:2017-08-09 16:17:27

标签: hook airflow druid

我正在尝试使用德鲁伊钩子将数据从hdfs加载到德鲁伊,下面是我的dag脚本:

from datetime import datetime, timedelta
import json
from airflow.hooks import HttpHook, DruidHook
from airflow.operators import PythonOperator
from airflow.models import DAG

def check_druid_con():
 dr_hook = DruidHook(druid_ingest_conn_id='DRUID_INDEX',druid_query_conn_id='DRUID_QUERY')
 dr_hook.load_from_hdfs("druid_airflow","hdfs://xx.xx.xx.xx/demanddata/demand2.tsv","stay_date",["channel","rate"],"2016-12-11/2017-12-13",1,-1,metric_spec=[{ "name" : "count", "type" : "count" }],hadoop_dependency_coordinates="org.apache.hadoop:hadoop-client:2.7.3")

default_args = {
    'owner': 'TC',
    'start_date': datetime(2017, 8, 7),
    'retries': 1,
    'retry_delay': timedelta(minutes=5)
}
dag = DAG('druid_data_load', default_args=default_args)
druid_task1=PythonOperator(task_id='check_druid',
                   python_callable=check_druid_con,
                   dag=dag)

我一直收到错误,TypeError:load_from_hdfs()至少需要10个参数(给定10个)。但是我给load_from_hdfs提供了10个参数,但它仍然出错。请帮忙。

此致 拉胡

0 个答案:

没有答案