计算过滤的值 - Apache PIG

时间:2016-09-16 14:21:19

标签: hadoop apache-pig hadoop2

我有以下声明

Values = FILTER Input_Data BY Fields > 0 

如何计算已过滤的记录数而不是?

1 个答案:

答案 0 :(得分:1)

-- split into 2 datasets
SPLIT Input_data INTO A IF Field > 0, B if Field <= 0;

-- count > 0 records
A_grp = GROUP A ALL;
A_count = FOREACH A_grp GENERATE COUNT(A);

-- count <= 0 records
B_grp = GROUP B ALL;
B_count = FOREACH B_grp GENERATE COUNT(B);

希望这会有所帮助!!