我正在使用Apache Nutch 1.13进行爬网。在解析步骤期间,我收到此错误。我无法生成导致此错误的网址
java.lang.Exception: java.lang.NoSuchMethodError: org.apache.commons.compress.compressors.CompressorStreamFactory.<init>(Z)V
at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:522)
Caused by: java.lang.NoSuchMethodError: org.apache.commons.compress.compressors.CompressorStreamFactory.<init>(Z)V
at org.apache.tika.parser.pkg.CompressorParser.parse(CompressorParser.java:120)
at org.apache.nutch.parse.tika.TikaParser.getParse(TikaParser.java:134)
at org.apache.nutch.parse.ParseUtil.parse(ParseUtil.java:107)
at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:109)
at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:46)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.run(FutureTask.java:266)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
我跟踪了日志,但无法理解问题。任何帮助将不胜感激!