Question

我是Scrapy和Python的新手。我正在尝试使用forloop从网站（“https://in.bookmyshow.com/bengaluru/movies”）中提取数据，但它似乎没有工作

def parse(self, response):
      for sel in response.xpath('//div[@class="mv-row"]'):
          item = ExampleItem()
          item['Moviename'] = sel.xpath('.//a[@class="__movie-name"]//text()').extract()
          item['Language'] = sel.xpath('.//li[@class="__language"]//text()').extract()
          item['Info'] = sel.xpath('.//div[@class="__rounded-box __genre"]/text()').extract()
          yield item

Answer 1

定位器本身是正确的，您只需要修复正在循环的容器定位器：

替换：

for sel in response.xpath('//div[@class="mv-row"]'):

使用：

for sel in response.xpath('//div[contains(@class, "movie-card")]'):

无法使用scrapy在forloop中提取数据

1 个答案: