Question

案例：我试图从网站中提取页面数据。我在页面中创建了一个过滤器，如下所示代码：

 fp = webdriver.FirefoxProfile()
 fp.set_preference("javascript.enabled", True)
 b = webdriver.Firefox(firefox_profile=fp)
 b.get(url)
 time.sleep(10)
 search = b.find_element_by_name("rb")
 search.clear()
 search.send_keys('dove')
 search.send_keys(Keys.ESCAPE)
 search.submit()
 shampoo_sel = b.find_element_by_id('flt-46')
 shampoo_sel.click()
 conditioner_sel = b.find_element_by_id('flt-47')
 conditioner_sel.click()
 time.sleep(5)
 search_url = b.current_url
 dp = urllib2.urlopen(search_url).read()
 dp_soup = BeautifulSoup(dp)
 search_page_num = dp_soup.find("li", { "id" : "pagContinue" })
 print search_page_num

虽然我尝试使用当前URL保存代码（过滤器之前和之后的URL都相同，因此无法获得过滤后的确切页数）在这种情况下我该怎么做???

处理Ajax请求Python

0 个答案: