python - 如果我不在scrapy中抓取爬行蜘蛛的子类，我该如何关注链接 - Thinbug

如果我不在scrapy中抓取爬行蜘蛛的子类，我该如何关注链接

时间：2012-12-11 21:48:31

标签： python django scrapy

这是我的代码而我的蜘蛛没有关注链接我有来自InitSpider的子类

class TestSpider(InitSpider):
    name = 'login'
    allowed_domains = ['example.com']
    start_urls = ['http://www.example.com']
    rules = (Rule(SgmlLinkExtractor(allow=('example\.com', 'PostSearch'),restrict_xpaths = "//dd[@class='nextPage']"), callback='parse_items', follow=True),)

    def parse_items(self, response):

        hxs = HtmlXPathSelector(response)

正在使用此示例https://stackoverflow.com/a/5857202/1894766 *

0 个答案:

没有答案