标签: hadoop dask dask-distributed
以下是我用来连接hdfs和创建dask数据帧的代码。
Client(scheduler_host+":"+scheduler_port) df=dd.read_csv("hdfs://hdfs_host/<path to csv on hdfs>")
错误:
AttributeError: /usr/lib/libhdfs3.so: undefined symbol: hdfsConcat
HADOOP版本:2.5.1