python获取用户网站的URL

时间:2017-05-09 01:46:30

标签: python

使用selenium.driver写一个python程序登录一个网站并尝试抓取一些数据,程序:

About.ShowDialog();

在程序中,group_members_id是用户ID,例如10202964246625371,10211651800563690 ...,因此,group_members_id_url是

from selenium import webdriver
from selenium.webdriver.common.keys import Keys
FACEBOOK_URL = "https://www.facebook.com/" 
...

if __name__ == '__main__':
    #log in Facebook website
    login_url = "https://www.facebook.com/login.php?login_attempt=1&lwv=110"
    webdriver_browser = webdriver.Chrome()
    webdriver_browser.get(login_url)

    # user credentials
    user = webdriver_browser.find_element_by_css_selector("#email")
    user.send_keys('choikunchen@gmail.com')
    password = webdriver_browser.find_element_by_css_selector("#pass")
    password.send_keys('expectopatronum')
    login = webdriver_browser.find_element_by_css_selector("#loginbutton")
    login.click()
    time.sleep(5)
    # the login maybe fail, return to the login page
    if "login" in webdriver_browser.current_url:
        webdriver_browser.close()
    time.sleep(10)
     ...
    get_group_user_friends_info(webdriver_browser,group_id)

def get_group_user_friends_info(web_driver,group_id):
    ...
    for group_members_id in group_members_id_list:
        group_members_id_url = FACEBOOK_URL + str(group_members_id)
        web_driver.get(group_members_id_url)
        time.sleep(10)
        print(web_driver.current_url)

错误,chrome显示:

"https://www.facebook.com/10202964246625371",
"https://www.facebook.com/10211651800563690"..., I consider, when executing "web_driver.get(group_members_id_url)" and "time.sleep(10)", the "web_driver.current_url" has changed to 

"https://www.facebook.com/gabriela.carrazano(correspongding to https://www.facebook.com/10202964246625371)",
"https://www.facebook.com/yassmeen.ezzerqti(corrsponding to https://www.facebook.com/10211651800563690)"...,and these are what I need, however,when the program execute:

web_driver.get(group_members_id_url)

并且在浏览器地址栏中,它是

Sorry, this content isn't available right now
The link you followed may have expired, or the page may only be visible to    an audience you're not in.

等等......因此,它还没有跳到

"https://www.facebook.com/10202964246625371", not "https://www.facebook.com/gabriela.carrazano", and )",
"https://www.facebook.com/10211651800563690", not https://www.facebook.com/yassmeen.ezzerqti", 

"https://www.facebook.com/gabriela.carrazano", 
"https://www.facebook.com/yassmeen.ezzerqti"
...

显示

print(web_driver.current_url)

然而,当我打开chrome浏览器并输入时     https://www.facebook.com/10202964246625371     https://www.facebook.com/10211651800563690     ...

在地址栏中,浏览器跳转到

"https://www.facebook.com/10202964246625371", not "https://www.facebook.com/gabriela.carrazano", 
"https://www.facebook.com/10211651800563690", not https://www.facebook.com/yassmeen.ezzerqti"

自动成功 你能告诉我如何解决这个问题

0 个答案:

没有答案