使用Nutch 2.4注入运行时异常

时间:2017-01-06 19:47:10

标签: java nutch accumulo

我正试图将种子注入Nutch。

我使用的命令:

bin/nutch inject /root/project/nutch-old/runtime/local/conf/urls/

结果:

InjectorJob: starting at 2017-01-06 05:29:21

InjectorJob: Injecting urlDir: /root/project/nutch-old/runtime/local/conf/urls

InjectorJob: Using class org.apache.gora.accumulo.store.AccumuloStore as the Gora storage class.

InjectorJob: java.lang.RuntimeException: job failed: name=apache-nutch-2.4-SNAPSHOT.jar, jobid=job_local798287578_0001

at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:247)

at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:268)

at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:291)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)

at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:300)

1 个答案:

答案 0 :(得分:0)

您还需要提及种子URL的文件名,您可能已将其保存在urls目录下。你的命令就像bin/nutch inject urls/seed.txt 如果有效,请试试让我知道。