我正试图将种子注入Nutch。
我使用的命令:
bin/nutch inject /root/project/nutch-old/runtime/local/conf/urls/
结果:
InjectorJob: starting at 2017-01-06 05:29:21
InjectorJob: Injecting urlDir: /root/project/nutch-old/runtime/local/conf/urls
InjectorJob: Using class org.apache.gora.accumulo.store.AccumuloStore as the Gora storage class.
InjectorJob: java.lang.RuntimeException: job failed: name=apache-nutch-2.4-SNAPSHOT.jar, jobid=job_local798287578_0001
at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:247)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:268)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:291)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:300)
答案 0 :(得分:0)
您还需要提及种子URL的文件名,您可能已将其保存在urls目录下。你的命令就像bin/nutch inject urls/seed.txt
如果有效,请试试让我知道。