Selenium只点击一次链接,再次调用click()会返回错误

时间:2014-03-24 08:39:08

标签: python selenium screen-scraping

我是Selenium的新手,并且设法编写了这些代码。我想通过点击'>'来废弃表格中的数据。链接在右下角。第一次点击有效,但接下来的两次没有。我错过了什么?感谢。

# coding: utf-8
from selenium import webdriver
from selenium.webdriver.common.by import By

browser = webdriver.Firefox()
browser.get('http://s.cafef.vn/Lich-su-giao-dich-HSG-1.chn')

next_page_link = browser.find_element_by_partial_link_text('>')
next_page_link.click()
next_page_link = browser.find_element_by_partial_link_text('>')
next_page_link.click()
next_page_link = browser.find_element_by_partial_link_text('>')
next_page_link.click()

这是例外

Traceback (most recent call last):
  File "cafef.py", line 13, in <module>
    next_page_link.click()
  File "/usr/local/lib/python2.7/dist-packages/selenium-2.40.0-py2.7.egg/selenium/webdriver/remote/webelement.py", line 59, in click
    self._execute(Command.CLICK_ELEMENT)
  File "/usr/local/lib/python2.7/dist-packages/selenium-2.40.0-py2.7.egg/selenium/webdriver/remote/webelement.py", line 369, in _execute
    return self._parent.execute(command, params)
  File "/usr/local/lib/python2.7/dist-packages/selenium-2.40.0-py2.7.egg/selenium/webdriver/remote/webdriver.py", line 164, in execute
    self.error_handler.check_response(response)
  File "/usr/local/lib/python2.7/dist-packages/selenium-2.40.0-py2.7.egg/selenium/webdriver/remote/errorhandler.py", line 164, in check_response
    raise exception_class(message, screen, stacktrace)
selenium.common.exceptions.StaleElementReferenceException: Message: u'Element is no longer attached to the DOM' ; Stacktrace: 
    at fxdriver.cache.getElementAt (resource://fxdriver/modules/web_element_cache.js:7613)
    at Utils.getElementAt (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:7210)
    at fxdriver.preconditions.visible (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:8223)
    at DelayedCommand.prototype.checkPreconditions_ (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:10861)
    at DelayedCommand.prototype.executeInternal_/h (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:10878)
    at DelayedCommand.prototype.executeInternal_ (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:10883)
    at DelayedCommand.prototype.execute/< (file:///tmp/tmpIDOSdW/extensions/fxdriver@googlecode.com/components/command_processor.js:10825) 

2 个答案:

答案 0 :(得分:4)

第二次next_page_link.click()调用在浏览器加载下一页之前发生。 使用wait.until添加EC.element_to_be_clickable

from selenium import webdriver
import selenium.webdriver.support.ui as UI
from selenium.webdriver.common.by import By
import selenium.webdriver.support.expected_conditions as EC 
import contextlib

with contextlib.closing(webdriver.Firefox()) as browser:
    browser.get('http://s.cafef.vn/Lich-su-giao-dich-HSG-1.chn')
    wait = UI.WebDriverWait(browser, 10)
    for i in range(3):            
        next_page_link = wait.until(
            EC.element_to_be_clickable((By.PARTIAL_LINK_TEXT, '>')))
        next_page_link.click()

答案 1 :(得分:1)

通常来自unutbu的anwser就足够了。

我使用.net而不是python的selenium所以我不能给出python的详细代码但是为了稳定你应该等待3次:

  1. 等待下一页或ajax内容加载,在这种情况下你应该等到 .CafeF_Paging td span strong的文字是您排除的页码。
  2. 等待jquery加载,直到jQuery.active == 0
    查看http://sullerton.com/2013/08/selenium-webdriver-wait-for-ajax-with-python/
  3. 等待元素可见,不仅定位,使用visibility_of_element_located。