改变猪模式

时间:2016-04-08 09:39:36

标签: apache-pig

我正在尝试更改架构:

describe JOINED_WITH_PARENT;
JOINED_WITH_PARENT: {ROWS_TUPLES::id: chararray,ROWS_TUPLES::url: chararray,ROWS_TUPLES::score: chararray,ROWS_TUPLES::parentId: chararray,ROWS_TUPLES::title: chararray,ROWS_TUPLES::body: chararray,GROUPED_PARENT_ID::group: chararray,GROUPED_PARENT_ID::ROWS_TUPLES: {(id: chararray,url: chararray,score: chararray,parentId: chararray,title: chararray,body: chararray)}}

要:

JOINED_WITH_PARENT: {id: chararray,url: chararray,:score: parentId: chararray,title: chararray,body: chararray,group: chararray, answers: {(id: chararray,url: chararray,score: chararray,parentId: chararray,title: chararray,body: chararray)}}

我正在考虑在JOINED_WITH_PARENT上做一个foreach并尝试使用以下内容,但语法似乎不正确。我是在正确的轨道上吗?任何帮助将不胜感激

A = FOREACH JOINED_WITH_PARENT GENERATE 
   id as (id:chararray),
   score as (score:chararray),
   parentId as (parentId:chararray),
   title as (title:chararray),
   body as (body:chararray),
   group as (group:chararray),
   answers as tuple(chararray,chararray,chararray,chararray,chararray)GROUPED_PARENT_ID::ROWS_TUPLES;

1 个答案:

答案 0 :(得分:0)

如果你只是想要设置字段,你不应该指定类型,当你从连接项目中进行项目时需要指定“源”关系,如下所示:

A = FOREACH JOINED_WITH_PARENT GENERATE 
   ROWS_TUPLES::id as id,
   ROWS_TUPLES::score as score,
   ...
   GROUPED_PARENT_ID::ROWS_TUPLES as answers;