尝试在http://72.ru网站上进行身份验证,发现有重定向到https://loginka.ru/auth/。发现在数据表单中有302个带有普通凭证的POST。从Chrome复制标题可以在cURL中重现该标题,但仍无法在请求模块中访问。
警告:页面上满是俄文字母,在东北方框中注册
with requests.Session() as s:
s.auth = ('EMAIL', 'PASSWD')
s.post('http://72.ru/passport/login.php')
p = s.get('http://72.ru/job/favorite/vacancy/')
# will print True if logged
print('some title from favorite page, if logged' in p.text)
为什么无法进行身份验证,我做错了什么?
答案 0 :(得分:2)
我认为您需要指定allow_redirects=True
s.post('http://72.ru/passport/login.php', allow_redirects=True)
答案 1 :(得分:2)
有一种更简单的方式来登录这个网站。
import requests
headers = {
"User-Agent":
"Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36",
}
s = requests.session()
s.headers.update(headers)
# There is a dedicated login page, which is the url of the Login button on the site, you can open that directly.
# Requests will automatically take care of rediects
s.get('https://loginka.ru/auth/?url=http%3A%2F%2F72.ru')
# Generate the post data
data = {
'url': 'http://72.ru',
'email': username,
'password': password
}
# Perform the post request
r = s.post('https://loginka.ru/auth/?url=http%3A%2F%2F72.ru', data=data)
# There is an extra post request on this site which uses token from redirect url
token = r.url[r.url.index('token=')+6:]
url = 'http://72.ru/put_token_to_user/?token=' + token + '&dummy_put_token_to_user=yes'
headers2 = {'X-Requested-With': 'XMLHttpRequest', 'Referer': r.url}
r = s.get(url, headers=headers2)
r = s.get('http://72.ru/passport/mypage.php')
print r.url
print r.status_code
with open('abc.txt', 'wb') as f:
f.write(r.content)
答案 2 :(得分:1)
from calendar import timegm
from time import gmtime
import requests
headers = {
"User-Agent":
"Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2272.101 Safari/537.36",
}
s = requests.session()
s.headers.update(headers)
epoch1 = '%s000' % timegm(gmtime())
s.get('http://72.ru/')
epoch2 = '%s000' % timegm(gmtime())
login_url = 'https://loginka.ru/service/api/passport/auth/token/?callback=jQuery17107978048569057137_%s&_=%s' % (epoch1, epoch2)
s.get(login_url)
epoch3 = '%s000' % timegm(gmtime())
params = {
'callback': 'jQuery17107978048569057137_%s' % epoch1,
'email': username, # Username Email
'password': password, # Password
'remember': 0,
'_': epoch3
}
r = self.s.get('https://loginka.ru/service/api/passport/auth/login/', params=params)
print r.content