Question

我编写了一个脚本来解析来自网站的链接。该页面的左侧栏中有24个链接。我的脚本可以解析它们。但是，我想只获得前20个链接。我现在该怎么办？

以下是我现在尝试的脚本：

import requests
from lxml import html

response = requests.get("http://www.trmsites.com/shop/home.asp?siteid=90122").text
tree = html.fromstring(response)
for titles in tree.xpath("//td[@class='leftnav']/a/@href"):
    print(titles)

Answer 1

tree.xpath("//td[@class='leftnav']/a/@href")是一个列表。你可以切片。

tree.xpath("//td[@class='leftnav']/a/@href")[:20]

无法获得特定链接而不是所有链接

1 个答案: