无法使用Selenium webdriver,python </span>模拟<span>标记内的onclick javascript

时间:2014-09-24 13:02:40

标签: javascript python selenium web-scraping beautifulsoup

我正在尝试使用以下标记模拟onclick事件

span class =“taLnk hvrIE6 tr165579546 moreLink ulBlueLinks”onclick =“ta.util.cookie.setPIDCookie(2247); ta.call('ta.servlet.Reviews.expandReviews',event,this,'review_165579546',' 1',2247)“&gt; 更多

这用于查看此链接下的更多文本。我使用selenium webdriver和python自动为此网页模拟此事件  http://www.tripadvisor.in/Hotel_Review-g297586-d1154547-Reviews-Rainbow_International_Hotel-Hyderabad_Telangana.html此网页。

任何人都可以共享一个代码片段,以便激活这个javascript事件,以便页面加载,我可以自动看到该链接下的全文...我尝试使用selenium webdriver的click()选项,但它不起作用

2 个答案:

答案 0 :(得分:0)

这对我有用:

from selenium import webdriver

url = 'http://www.tripadvisor.in/Hotel_Review-g297586-d1154547-Reviews-Rainbow_International_Hotel-Hyderabad_Telangana.html'

browser = webdriver.Firefox()
browser.get(url)
li = browser.find_element_by_css_selector('#PERSISTENT_TAB_HR .tabs_pers_content li:nth-child(2)')
li.click()

答案 1 :(得分:0)

以下是您可以从以下开始的想法:

  • 遍历网页上的所有评论(以id开头的review_ div元素)
  • 对于每次审核,点击More链接(如果有)(范围为moreLink班级名称)
  • 经过一小段时间后,获得完整的评论文本

以下是实施:

import time
from selenium import webdriver
from selenium.common.exceptions import NoSuchElementException, TimeoutException
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC

driver = webdriver.Firefox()
driver.get("http://www.tripadvisor.com/Hotel_Review-g297586-d1154547-Reviews-Rainbow_International_Hotel-Hyderabad_Telangana.html")

for review in driver.find_elements_by_xpath('//div[starts-with(@id, "review_")]'):
    try:
        more = WebDriverWait(review, 3).until(EC.presence_of_element_located((By.CLASS_NAME, 'moreLink')))
        if more.is_displayed():
            more.click()
            time.sleep(1)
    except (NoSuchElementException, TimeoutException):
        pass

    full_review = review.find_element_by_class_name('dyn_full_review')
    print full_review.text
    print "----"

打印(输出包含每个评论中的所有文本,包括用户名和日期):

Mustufa W
1 review
“Horrible”
Reviewed August 15, 2014
I checked on price was high but cracked a deal
Poor hygiene in corridor & so in rooms. Washroom pipes were leaking. AC water dripping in washroom.
First I was given a room to which my surprise found window pane was missing after complaining room got changed.
They are cheating ppl only good thing abt hotel is the spot & is damn opposite Nilofer cafe which serves delicious tea,coffee & bakery products.
There is a guy named khwaja who was very helpful. Front @ reception guy was stupid..in one midnight , power went off & to my surprise they don't have power back up..
Stayed August 2014, traveled as a couple
Less
Was this review helpful?
Yes
Ask Mustufa W about Rainbow International Hotel
This review is the subjective opinion of a TripAdvisor member and not of TripAdvisor LLC.
----
mrravi4u
Bangalore, India
2 reviews
13 helpful votes
“Good Hotel”
Reviewed April 23, 2014
I stayed there 2 days i got good services. Rainbow Is good hotel in Hyderabad. there are very homely environment and hosting services was supper. it is also in center of hyderabad city so for convenience is is better place to stay.
Room Tip: Office Meeting
See more room tips
Stayed March 2014, traveled with friends
Value
Location
Sleep Quality
Rooms
Cleanliness
Service
Was this review helpful?
Yes
13
Ask mrravi4u about Rainbow International Hotel
This review is the subjective opinion of a TripAdvisor member and not of TripAdvisor LLC.
----
...

希望能让你更清楚。