注入运行时异常nutch 2.3

时间:2015-06-17 07:52:09

标签: nutch

我遇到了设置Nutch 2.3和hbase 0.94:

fx@fx:~$ $NUTCH_HOME/runtime/local/bin/nutch inject file:///home/fx/Abivin/apache-nutch-2.3/seed/urls.txt
InjectorJob: starting at 2015-06-17 14:46:35
InjectorJob: Injecting urlDir: file:/home/fx/Abivin/apache-nutch-2.3/seed/urls.txt
InjectorJob: Using class org.apache.gora.memory.store.MemStore as the Gora storage class.
InjectorJob: java.lang.RuntimeException: job failed: name=inject file:/home/fx/Abivin/apache-nutch-2.3/seed/urls.txt, jobid=job_local1999341506_0001
    at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:231)
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:252)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:275)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:284)

当seed / urls.txt存储网址时。我已经搜索了许多类似的错误,但仍然坚持这一点。请给我一些想法来解决。感谢

1 个答案:

答案 0 :(得分:0)

似乎Nutch无法将URL注入“网页”表。首先,请检查gora-hbase中的配置。在配置正确的情况下,您应该删除hbase数据目录并重新开始。

希望这有帮助