nutch 2.2生成器在curl作业期间运行时异常失败

时间:2016-08-18 15:44:16

标签: java solr nutch runtimeexception solrcloud

我在卷曲期间遇到以下错误,有时它会失败。 当我每天检查几个集合已经失败(不是模式)时,我有收集的数量(en,es,it,等等),今天可能是“en”集合但明天它“en”将成功卷曲, 无法识别任何合作伙伴或问题,检查inode和磁盘空间,没有问题,

Nutch:2.2.1
Java:1.7
Hbase:0.90.4

昨天为en

InjectorJob: java.lang.RuntimeException: job failed: name=[crawl_en_03_live]inject /software/bea/nutch/live/en_03/seed_03.txt, jobid=job_local406356849_0001
    at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

北方语言的另一天

InjectorJob: Injecting urlDir: /software/bea/nutch/live/pt/seed.txt
InjectorJob: Using class org.apache.gora.hbase.store.HBaseStore as the Gora           storage class.
  InjectorJob: java.lang.RuntimeException: job failed: name=     [crawl_pt_live]inject /software/bea/nutch/live/pt/seed.txt,     jobid=job_local1327090171_0001
    at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

任何帮助欣赏

0 个答案:

没有答案