带重定向的python请求

时间:2015-06-14 23:00:09

标签: python authentication redirect curl python-requests

尝试在http://72.ru网站上进行身份验证,发现有重定向到https://loginka.ru/auth/。发现在数据表单中有302个带有普通凭证的POST。从Chrome复制标题可以在cURL中重现该标题,但仍无法在请求模块中访问。

警告:页面上满是俄文字母,在东北方框中注册

with requests.Session() as s:
    s.auth = ('EMAIL', 'PASSWD')

    s.post('http://72.ru/passport/login.php')
    p = s.get('http://72.ru/job/favorite/vacancy/')

    # will print True if logged
    print('some title from favorite page, if logged' in p.text)

为什么无法进行身份验证,我做错了什么?

3 个答案:

答案 0 :(得分:2)

我认为您需要指定allow_redirects=True

s.post('http://72.ru/passport/login.php', allow_redirects=True)

答案 1 :(得分:2)

有一种更简单的方式来登录这个网站。

import requests

headers = {
    "User-Agent":
        "Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36",
}

s = requests.session()
s.headers.update(headers)

# There is a dedicated login page, which is the url of the Login button on the site, you can open that directly. 
# Requests will automatically take care of rediects
s.get('https://loginka.ru/auth/?url=http%3A%2F%2F72.ru')

# Generate the post data
data = {
    'url': 'http://72.ru',
    'email': username,
    'password': password
}

# Perform the post request
r = s.post('https://loginka.ru/auth/?url=http%3A%2F%2F72.ru', data=data)

# There is an extra post request on this site which uses token from redirect url
token = r.url[r.url.index('token=')+6:]
url = 'http://72.ru/put_token_to_user/?token=' + token + '&dummy_put_token_to_user=yes'
headers2 = {'X-Requested-With': 'XMLHttpRequest', 'Referer': r.url}
r = s.get(url, headers=headers2)

r = s.get('http://72.ru/passport/mypage.php')
print r.url
print r.status_code
with open('abc.txt', 'wb') as f:
    f.write(r.content)

答案 2 :(得分:1)

from calendar import timegm
from time import gmtime

import requests

headers = {
    "User-Agent":
        "Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36",
}

s = requests.session()
s.headers.update(headers)
epoch1 = '%s000' % timegm(gmtime())
s.get('http://72.ru/')
epoch2 = '%s000' % timegm(gmtime())
login_url = 'https://loginka.ru/service/api/passport/auth/token/?callback=jQuery17107978048569057137_%s&_=%s' % (epoch1, epoch2)
s.get(login_url)
epoch3 = '%s000' % timegm(gmtime())
params = {
    'callback': 'jQuery17107978048569057137_%s' % epoch1,
    'email': username,     # Username Email
    'password': password,     # Password
    'remember': 0,
    '_': epoch3
}
r = self.s.get('https://loginka.ru/service/api/passport/auth/login/', params=params)
print r.content