我的例子:
from bs4 import BeautifulSoup as bs
import requests
from urllib.request import urlopen
base_url = 'https://book24.kz/catalog/klassicheskaya_proza/'
def BB_parse(base_url, headers):
soup = bs(request.content, 'lxml')
for pagination in soup.find_all('div', attrs={'class': 'pagination'}):
for num in pagination.find_all('a'):
rrr = num.getText().replace('Ctrl →','').replace("\n","")
print(rrr)
输出:
2
3
37
38
那么,如何仅输出最后一行? (如38)
答案 0 :(得分:0)
import requests
from bs4 import BeautifulSoup
r = requests.get("https://book24.kz/catalog/klassicheskaya_proza/")
soup = BeautifulSoup(r.text, 'html.parser')
for item in soup.findAll("div", {'class': 'pagination'}):
result = [tag.text for tag in item.findAll("a")]
print(result[-2])