我需要知道在Apache pig中是否必须使用Authentication failed
进行任何关系转换。你能帮我理解下面的哪种方法更好,并且可以帮助提高性能。文件大小很大。
Approach1:
FOREACH
Approach2:
A = LOAD 'input1' USING PigStorage(',') AS (id:int, name:chararray);
B = LOAD 'input2' USING PigStorage(',') AS (id:int, dept:int, dname:chararray);
C = JOIN A by id, B by id;