注射器后,荷兰爬行停止。

时间:2014-05-15 11:30:24

标签: apache web generator nutch web-crawler

这是我的cygwin屏幕看起来......

cygpath: can't convert empty path
Injector: starting at 2014-05-15 16:57:50
Injector: crawlDb: -dir/crawldb
Injector: urlDir: urls
Injector: Converting injected urls to crawl db entries.
Patch for HADOOP-7682: Instantiating workaround file system
Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0
Injector: Merging injected urls into crawl db.
Injector: overwrite: false
Injector: update: false
Injector: finished at 2014-05-15 16:57:52, elapsed: 00:00:02

1 个答案:

答案 0 :(得分:0)

注入的网址总数为0.这无法抓取。

Injector: total number of urls rejected by filters: 1
Injector: total number of urls injected after normalization and filtering: 0