如何为python ibis或请求设置impala namenode rpc端口号

时间:2016-04-08 19:25:30

标签: python-requests port impala ibis

我正在使用&ibis-framework'。我有

hdfs_client = ibis.hdfs_connect(...)
impala_client = ibis.impala.connect(..., hdfs_client=hdfs_client)
db = impala_client.database('abc')
data = pd.DataFrame(...)
db.create_table('tb_name', obj=data, format='parquet', force=True)

此失败是因为namenoderpcaddress(在ibis中调用)构造的requests具有端口8020,而对我来说正确的是8022(这是Cloudera推荐的端口;可能用于HA目的)。 / p>

ConnectionError: HTTPConnectionPool(host='ip-0-0-0-0.ec2.internal', port=50075): Max retries exceeded with url: /webhdfs/v1/tmp/ibis/pandas_7ae170c27ee6426e97e0f84aa9a2a778/0.csv?op=CREATE&user.name=user&namenoderpcaddress=ip-0-0-0-0.ec2.internal:8020&overwrite=false&user.name=user (Caused by NewConnectionError('<requests.packages.urllib3.connection.HTTPConnection object at 0x7feb44be49d0>: Failed to establish a new connection: [Errno -2] Name or service not known',))

(上面的msg中的ip地址由我编辑)

我知道如何判断ibis或请求此端口号吗?

感谢。

1 个答案:

答案 0 :(得分:0)

ibis.hdfs_connect会使用port参数来传递8022