使用selenium.driver写一个python程序登录一个网站并尝试抓取一些数据,程序:
About.ShowDialog();
在程序中,group_members_id是用户ID,例如10202964246625371,10211651800563690 ...,因此,group_members_id_url是
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
FACEBOOK_URL = "https://www.facebook.com/"
...
if __name__ == '__main__':
#log in Facebook website
login_url = "https://www.facebook.com/login.php?login_attempt=1&lwv=110"
webdriver_browser = webdriver.Chrome()
webdriver_browser.get(login_url)
# user credentials
user = webdriver_browser.find_element_by_css_selector("#email")
user.send_keys('choikunchen@gmail.com')
password = webdriver_browser.find_element_by_css_selector("#pass")
password.send_keys('expectopatronum')
login = webdriver_browser.find_element_by_css_selector("#loginbutton")
login.click()
time.sleep(5)
# the login maybe fail, return to the login page
if "login" in webdriver_browser.current_url:
webdriver_browser.close()
time.sleep(10)
...
get_group_user_friends_info(webdriver_browser,group_id)
def get_group_user_friends_info(web_driver,group_id):
...
for group_members_id in group_members_id_list:
group_members_id_url = FACEBOOK_URL + str(group_members_id)
web_driver.get(group_members_id_url)
time.sleep(10)
print(web_driver.current_url)
错误,chrome显示:
"https://www.facebook.com/10202964246625371",
"https://www.facebook.com/10211651800563690"..., I consider, when executing "web_driver.get(group_members_id_url)" and "time.sleep(10)", the "web_driver.current_url" has changed to
"https://www.facebook.com/gabriela.carrazano(correspongding to https://www.facebook.com/10202964246625371)",
"https://www.facebook.com/yassmeen.ezzerqti(corrsponding to https://www.facebook.com/10211651800563690)"...,and these are what I need, however,when the program execute:
web_driver.get(group_members_id_url)
并且在浏览器地址栏中,它是
Sorry, this content isn't available right now
The link you followed may have expired, or the page may only be visible to an audience you're not in.
等等......因此,它还没有跳到
"https://www.facebook.com/10202964246625371", not "https://www.facebook.com/gabriela.carrazano", and )",
"https://www.facebook.com/10211651800563690", not https://www.facebook.com/yassmeen.ezzerqti",
当
"https://www.facebook.com/gabriela.carrazano",
"https://www.facebook.com/yassmeen.ezzerqti"
...
显示
print(web_driver.current_url)
然而,当我打开chrome浏览器并输入时 https://www.facebook.com/10202964246625371 https://www.facebook.com/10211651800563690 ...
在地址栏中,浏览器跳转到
"https://www.facebook.com/10202964246625371", not "https://www.facebook.com/gabriela.carrazano",
"https://www.facebook.com/10211651800563690", not https://www.facebook.com/yassmeen.ezzerqti"
自动成功 你能告诉我如何解决这个问题