在CDH 5.3.0上安装pydoop 1.0.0-rc1

时间:2015-02-04 14:46:19

标签: python hadoop

我正在尝试在运行CDH 5.3.0的Linux服务器上升级到pydoop 1.0.0-rc1。我能够在运行CDH 5.1.2的另一个环境上完成安装。我运行命令" pip install --proxy .... -upgrade pydoop == 1.0.0-rc1"并下载包并完成c ++编译。当它进入java构建时它失败了:

running build_scripts                                                                                                  
creating build/scripts-2.7                                                                                             
copying and adjusting scripts/pydoop -> build/scripts-2.7                                                              
changing mode of build/scripts-2.7/pydoop from 644 to 755                                                              
hadoop_home: '/opt/cloudera/parcels/CDH-5.3.0-1.cdh5.3.0.p0.30/jars'                                                   
hadoop_version: '2.5.0-cdh5.3.0'                                                                                       
java_home: '/usr/java/latest'                                                                                          
Building java code for hadoop-2.5.0-cdh5.3.0                                                                           
Compiling Java classes                                                                                                 
src/v1/it/crs4/pydoop/NoSeparatorTextOutputFormat.java:31: cannot find symbol                                          
symbol  : class JobConf                                                                                                
location: package org.apache.hadoop.mapred                                                                             
import org.apache.hadoop.mapred.JobConf;                                                                               
                               ^                                                                                       
src/v1/it/crs4/pydoop/NoSeparatorTextOutputFormat.java:32: cannot find symbol                                          
symbol  : class RecordWriter                                                                                           
location: package org.apache.hadoop.mapred                                                                             
import org.apache.hadoop.mapred.RecordWriter;                                                                          
                               ^                                                                                       
src/v1/it/crs4/pydoop/NoSeparatorTextOutputFormat.java:33: cannot find symbol                                          

根据日志记录,它具有hadoop库的位置,所以我不知道接下来会尝试什么?

此版本的pydoop是否支持CDH 5.3.0?有任何建议如何解决这个问题?

感谢您的任何建议。

路易斯

1 个答案:

答案 0 :(得分:1)

我是Pydoop开发人员。我在最后一个pydoop发行版(1.0.0)上测试了cdh 5.3.0并且它运行了。也许最好的办法是删除以前的安装并重新运行:sudo pip install pydoop