Pig Dump命令抛出错误

时间:2014-11-15 15:39:46

标签: hadoop apache-pig

无法从连接中获取数据。

数据: 豪尔赫波萨达|洋基队| {(捕手,2000年),(Designated_hitter,2001年)} | [游戏#1594,#hit_by_pitch 65,grand_slams#7] Landon Powell |奥克兰| {(Catcher,2000),(First_baseman,2001)} | [on_base_percentage#0.297,游戏#26,home_runs#7] 马丁普拉多|亚特兰大| {(Second_baseman,2002年),(内野手,2003年),(Left_fielder,2001年)} | [游戏#258,hit_by_pitch#3]

**Code:**
bfile= LOAD 'basketball1.txt' using PigStorage('|') as (name:chararray,team:chararray,pos:bag{t:tuple(point:chararray,year:int)},bat:map[]);
bfile1= foreach bfile generate name,pos.year as year;
bfile2= foreach bfile1 generate name,flatten(year) as play_year ;
bfile3= group bfile2 by play_year;
bfile4= foreach bfile3 generate group,COUNT($1) as count;
bfile5= foreach bfile generate flatten(pos.year) as year,bat#'games' as games_cnt;
bfile6= group bfile5 by year;
bjoin= join bfile3 by group ,bfile6 by group;
bjoin1= foreach bjoin generate bfile3.group,bfile3::bfile2.name as name,
bfile6::bfile5.games_cnt as tot_games;
**Describe bjoin1:**
bjoin: {bfile3::group: int,bfile3::bfile2: {(name: chararray,play_year: int)},
bfile6::group: int,bfile6::bfile5: {(year: int,games_cnt: bytearray)}}

在执行转储bjoin1 时,我遇到以下问题:

2014-11-15 07:31:42,318 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Some jobs have failed! Stop running all dependent jobs
2014-11-15 07:31:42,321 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias bjoin1
Details at logfile: /home/cloudera/pig_1416065344409.log
grunt> 2014-11-15 07:31:47,857 [communication thread] INFO  org.apache.hadoop.mapred.LocalJobRunner - reduce > reduce

0 个答案:

没有答案