如何网页抓取下一页

时间:2020-10-17 18:24:22

标签: python selenium web-scraping

我正在尝试从此网站https://publishednotices.asic.gov.au/browsesearch-notices/抓取数据

问题在于,没有下一页,并且网站是动态的。有人可以帮我吗?

2 个答案:

答案 0 :(得分:1)

尝试一下:

from selenium import webdriver
import time


browser = webdriver.Firefox()

# open url
browser.get("https://publishednotices.asic.gov.au/browsesearch-notices/")

for page in range(3,7):
    browser.find_element_by_css_selector(f"td:nth-child({page}) a").click()

    # get data

    time.sleep(1)

tmp = True
while tmp:
    for page in range(4, 9):
        try:
            browser.find_element_by_css_selector(f"td:nth-child({page}) a").click()
            time.sleep(1)
        except:
            tmp = False 
            

答案 1 :(得分:0)

您可以获得下一个页码元素,然后使用element.click()

单击它