猪:解析失败:输入不匹配' id'期待设置为null

时间:2014-06-28 00:16:51

标签: hadoop mapreduce apache-pig

我正在使用Pig 0.12.1并拥有以下Pig代码:

C = LOAD '$file' USING myCustomLoader();
D = FOREACH C GENERATE key#id;

我正在使用自定义加载程序加载文件。然后我想生成存储在key地图中的所有ID。

为什么我收到以下错误消息:

14/06/27 16:56:21 ERROR pig.PigServer: exception during parsing: Error during parsing.     <line 3, column 28>  mismatched input 'id' expecting set null
Failed to parse: <line 3, column 28>  mismatched input 'id' expecting set null

这是完整的堆栈跟踪:

14/06/27 16:56:21 ERROR pig.PigServer: exception during parsing: Error during parsing.     <line 3, column 28>  mismatched input 'id' expecting set null
Failed to parse: <line 3, column 28>  mismatched input 'id' expecting set null
        at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:241)
        at org.apache.pig.parser.QueryParserDriver.parse(QueryParserDriver.java:179)
        at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1676)
        at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1623)
        at org.apache.pig.PigServer.registerQuery(PigServer.java:575)
        at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:1093)
        at org.apache.pig.pigunit.pig.GruntParser.processPig(GruntParser.java:61)
        at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:501)
        at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:198)
        at org.apache.pig.pigunit.pig.PigServer.registerScript(PigServer.java:56)
        at org.apache.pig.pigunit.PigTest.registerScript(PigTest.java:170)
        at org.apache.pig.pigunit.PigTest.assertOutput(PigTest.java:249)
        at com.testpig.PigTest.testPig(PigTest.java:159)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
        at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
        at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
        at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
        at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
        at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
        at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
        at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
        at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
        at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
        at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
        at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
        at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
        at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
        at org.apache.maven.surefire.junit4.JUnit4TestSet.execute(JUnit4TestSet.java:53)
        at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:123)
        at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:104)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:601)
        at org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:164)
        at org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:110)
        at org.apache.maven.surefire.booter.SurefireStarter.invokeProvider(SurefireStarter.java:172)
        at org.apache.maven.surefire.booter.SurefireStarter.runSuitesInProcessWhenForked(SurefireStarter.java:104)
        at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:70)
14/06/27 16:56:21 WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
14/06/27 16:56:21 INFO compress.CodecPool: Got brand-new compressor

1 个答案:

答案 0 :(得分:4)

问题中的错误代码是:

C = LOAD '$file' USING myCustomLoader();
D = FOREACH C GENERATE key#id;

正确的代码是:

C = LOAD '$file' USING myCustomLoader();
D = FOREACH C GENERATE key#'id';

在Pig中,要访问地图的密钥,您必须在密钥周围使用单引号。

有关更多信息,请参阅Philip(翻转)Kromer在https://www.mail-archive.com/dev@pig.apache.org/msg24691.html的帖子:

Omitting the quotes on the key dereference gives a very unhelpful error message.

{code}
users = FOREACH user_hashes GENERATE info#userid AS userid:chararray;

-- 400   ERROR: ERROR 1200: <file ./foo.pig, line 8, column 42>  [...] 
mismatched input 'userid' expecting set null
{code}

It may be that the user forgot the quotes, or may instead be assuming that Pig 
allows dereferencing a map by the value of an alias or expression:

{code}
users = FOREACH user_hashes GENERATE
  info#'username',               -- works
  info#username,                 -- need quotes around literal
  info#fullref,                  -- no, can't use an alias' value to deref
  info#(CONCAT('user',shortref)) -- and can't use an expression to deref
  ;
{code}