SolrIndexerJob:java.lang.RuntimeException:作业失败:

时间:2016-05-23 19:22:11

标签: apache nutch

Apache nutch \“crawl”脚本在索引到solr时失败。有没有想过这个?

IndexingJob: starting
SolrIndexerJob: java.lang.RuntimeException: job failed: name=[myId_1]Indexer, jobid=job_local483340309_0001
        at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:120)
        at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:154)
        at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:176)
        at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:202)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:211)

2 个答案:

答案 0 :(得分:1)

此消息信息量不大,检查/发布Solr日志是否存在任何相关错误。造成这种情况的常见原因之一是Nutch和Solr模式之间的不匹配。

答案 1 :(得分:0)

1)您必须配置gora.properties,

2)无论你拥有Gora(Mongo,HBase,Cassandra等等)背后的任何东西都没有响应,所以nutch需要" waitForCompletion",所以要确保它已启动并运行

确保你使用kill -9和旧的java nutch进程杀死旧的已失效进程,如果你找不到它们就重新启动(希望它不会成功......)