Question

我正在尝试创建一个Python脚本，在给定username和password的情况下，脚本会登录到Google，然后在登录时获取页面的HTML数据（例如Google主页）

我的代码不起作用，我不知道为什么。

import sys
reload(sys)
sys.setdefaultencoding('utf-8')    # need to do this to be able to write html data to a file

from requests import session

payload = {
    'Email': EMAIL,
    'Passwd': PASSWORD
}

headers = {'User-Agent': 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36'}    # to make Google think that this is a real browser not a python script

with session() as c:

    res1 = c.post('https://accounts.google.com/signin/challenge/sl/password', data=payload, headers=headers)

    res2 = c.get('http://www.google.com/', headers=headers)
    html = res2.text

    f = open("test.html", "w")
    f.write(html)
    f.close()

当我进入我撰写的html文件时，它并不表示我已登录，因为有一个＆＃34;登录＆＃34;按钮。

我打印出了cookies，并且有一个＆＃34; NID＆＃34;如果我在User-Agent get时没有使用http://google.com/标题，那么我会使用User-Agent标题，而{}}} Cookie会有GAPS个Cookie。< / p>

我从登录页面的html格式中获得了Email和Passwd。

有谁知道如何解决这个问题？

如何使用python-requests登录google来检索登录页面的html？

0 个答案: