什么是scrapy日志的意思

时间:2016-01-07 06:14:39

标签: python scrapy

例如

2016-01-07 11:37:19 [scrapy] INFO: Crawled 61 pages (at 61 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:38:19 [scrapy] INFO: Crawled 171 pages (at 110 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:39:19 [scrapy] INFO: Crawled 299 pages (at 128 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:40:19 [scrapy] INFO: Crawled 394 pages (at 95 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:41:19 [scrapy] INFO: Crawled 487 pages (at 93 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:42:19 [scrapy] INFO: Crawled 554 pages (at 67 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:43:19 [scrapy] INFO: Crawled 616 pages (at 62 pages/min), scraped 0 items (at 0 items/min)
2016-01-07 11:44:19 [scrapy] INFO: Crawled 743 pages (at 127 pages/min), scraped 0 items (at 0 items/min)
  1. 这个词" Crawled","刮掉"是什么意思?
  2. 当scrapy打印日志时,例如&#34;抓取743页(每分钟127页),刮掉0项(0项/分钟)&#34;,那时候调用哪个函数?< / LI>

1 个答案:

答案 0 :(得分:-1)

  1. 抓取的网页是您的一位蜘蛛要求的网页。根据你的编程方式,它也应该解析它。被抓取的项是从该解析中提取的一组数据。两者都在scrapy教程中解释:itemsspiders

  2. 我不确定,但如果我没记错的话,这是在蜘蛛完成工作时打印的。