Question

我需要阅读的网页包含具有索引号

的div类名

 <div class="abber" id="abber1">
   <div class="Listing_All Listing_Level5_CONTAINER">
     <div class="Listing_Level5_HEADER">
        .....
     </div>
   </div>
   <div class="Listing_All Listing_Level4_CONTAINER">
     <div class="Listing_Level4_HEADER">
        .....
     </div>
   </div>
   <div class="Listing_All Listing_Level3_CONTAINER">
     <div class="Listing_Level3_HEADER">
        .....
     </div>
   </div>
 </div>

我如何为此编写选择器？截至目前我有这个

response.xpath('//div[contains(@class,"Listing_Level")]').extract()

但这会获取与“Listing_Level”匹配的所有内容。怎么能添加像“Listing_Level * _Container”这样的东西？

Answer 1

我不知道我是否理解你想要的东西，但你可以试试这个：

response.xpath('//div[contains(@class,"Listing_Level{}_Container")]'.format(n)).extract()

n是您想要的数字，例如，如果您想选择＆＃34; Listing_Level5_Container＆＃34;然后使用：

response.xpath('//div[contains(@class,"Listing_Level{}_Container")]'.format(5)).extract()

如何读取Scrapy Python中附加数字的div类名称

1 个答案: