在Pig中使用streaming_python时出错

时间:2014-04-29 18:04:56

标签: python apache-pig

当我运行以下内容时:

REGISTER /home/hduser/Documents/ccc/Research/phd/code/ECentre/scripts/bags.py USING streaming_python 
AS bp;

raw = LOAD 'hdfs:///user/hduser/smsCorpus_en_2012.04.30_all.xml'   AS (line:chararray);


b = foreach raw generate bp.enumerate_bag(line);

我得到了

Failed to parse: Pig script failed to parse: 
<file /home/hduser/Documents/ccc/Research/phd/code/ECentre/scripts/nltk.pig, line 13, column
25> Failed to generate logical plan. Nested exception: org.apache.pig.backend.executionengine.ExecException:
ERROR 1070: Could not resolve bp.enumerate_bag using imports: [, java.lang., org.apache.pig.builtin.,
org.apache.pig.impl.builtin.]

bags.py:

#!/usr/bin/env python 
def enumerate_bag(input):
    output = []
    for rank, item in enumerate(input):
        output.append(tuple([rank] + list(item)))
    return output

谁能告诉我为什么?

我的版本是: Apache Pig版本0.12.2-SNAPSHOT(r:未知) 编译于2014年4月29日,13:40:45

0 个答案:

没有答案