我正在使用Pig加载XML数据。当我发出ILLUSTRATE命令时,我收到一个错误。这就是我在做的事情。
REGISTER piggybank-0.15.0.jar
xml = LOAD '/xml/data/path' using org.apache.pig.piggybank.storage.XMLLoader('doc') as (x:chararray);
ILLUSTRATE xml;
这是 ERROR
....
.....
2018-03-20 19:56:52,265 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: xml[6,6] C: R:
2018-03-20 19:56:52,266 [main] ERROR org.apache.pig.pen.AugmentBaseDataVisitor - No (valid) input data found!
java.lang.RuntimeException: No (valid) input data found!
at org.apache.pig.pen.AugmentBaseDataVisitor.visit(AugmentBaseDataVisitor.java:585)
at org.apache.pig.newplan.logical.relational.LOLoad.accept(LOLoad.java:230)
at org.apache.pig.pen.util.PreOrderDepthFirstWalker.depthFirst(PreOrderDepthFirstWalker.java:82)
at org.apache.pig.pen.util.PreOrderDepthFirstWalker.walk(PreOrderDepthFirstWalker.java:66)
at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:52)
at org.apache.pig.pen.ExampleGenerator.getExamples(ExampleGenerator.java:180)
at org.apache.pig.PigServer.getExamples(PigServer.java:1250)
at org.apache.pig.tools.grunt.GruntParser.processIllustrate(GruntParser.java:831)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.Illustrate(PigScriptParser.java:802)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:381)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:198)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:173)
at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:69)
at org.apache.pig.Main.run(Main.java:547)
at org.apache.pig.Main.main(Main.java:158)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
2018-03-20 19:56:52,283 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2997: Encountered IOException. Exception
我已经验证了数据的路径,并确保数据确实存在。为了确认这一点,我使用PigStorage而不是XMLLoader加载了数据,我可以看到ILLUSTRATE命令正在工作,并且转储xml
在输出中有一些数据。我想这是导致问题的XMLLoader。不确定确切的问题。
非常感谢任何帮助
答案 0 :(得分:0)
当我在表加载上尝试执行ILLUSTRATE命令,尝试执行另一个功能并执行该操作时,也会发生这种情况。