尝试通过请求登录网站

时间:2019-12-19 23:19:31

标签: python python-requests

这是网站:https://nacionalidade.justica.gov.pt/

您会看到有一个输入类。 如何将带有参数的请求发送到网站,该请求将被发送到输入类?

输入密码后,页面会发生变化,如何获取新页面的内容?

this is the class I need to send information to

这是我需要向其发送密码的课程。

import requests
from lxml import html

payload = {'password' : 'mypassword... not gonna write it here',
       'ABE4A1723D5F2906F222936AD0E9BE0E' : 
       'ABE4A1723D5F2906F222936AD0E9BE0E'}
url = "https://nacionalidade.justica.gov.pt/"

session_requests = requests.session()
result = session_requests.get(url)

tree = html.fromstring(result.text)
authenticity_token = list(set(tree.xpath("//input[@name='ABE4A1723D5F2906F222936AD0E9BE0E']/@value")))[0]

result = session_requests.post(
    url, 
    data = payload, 
    headers = dict(referer=url)
)

r =requests.get(url,headers= dict(referer = url))
tree = html.fromstring(result.content)
bucket_names = tree.xpath("//div[@class='repo-list--repo']/a/text()")

print(bucket_names)

我遇到索引错误,我也不知道为什么

Traceback (most recent call last):
File "c:/Users/?/Desktop/?/?/new.py", line 12, in <module>

authenticity_token = list(set(tree.xpath("//input[@name='ABE4A1723D5F2906F222936AD0E9BE0E']/@value")))[0]
IndexError: list index out of range

2 个答案:

答案 0 :(得分:1)

这里有一个“如何正确操作”的示例,但是没有您的密码等。因此,请随时采用它作为解决方案。

from lxml.html import fromstring
import requests

session = requests.Session()

html = session.get('https://nacionalidade.justica.gov.pt/').content
form = fromstring(html).xpath('//form')[0]

post = {}
for input in form.inputs:
    if not input.name:
        continue

    post[input.name] = input.value

print('Initial form data')
print(post)

response = session.post('https://nacionalidade.justica.gov.pt/Home/GetEstadoProcessoAjax', data=post)
print(response.text)

输出

Initial form data
{'__RequestVerificationToken': 'p3AmUxqx-6Ipv9EbujoUid5TAKTm76oVmYIaB2UZMv_QzcH7LXaYhGwcQcqXqjWlJjJyvcKgcX48brUkWNvrmz-q3MPlg8mZAm56EGUooYw1', 'SenhaAcesso': '', 'DD94BAEBEF20FBD64FF8CA12170D623D': None, 'Email': None}
<link href="/layout/assets/css/grafico_nacionalidade.css" rel="stylesheet" />
<style>
    .titulo {
        color: #3399cc;
    }

    /*Bloco para o estado do processo*/
    #block_container {
        font-size: 1.5rem;
        font-weight: bold;
    }
    #bloc1
    {
        display:inline;
    }
    #bloc2
    {
        display:inline;
        font-size:1rem;
        font-weight:normal;
    }

    /*-------------------------------------*/

</style>
<div>


        <div id="txtErro" style="background-color:#FBEDED; width:100%; ">
            <div style="padding:35px;">
                A senha <b></b> n&#227;o corresponde a nenhum processo de nacionalidade ativo.
                <p></p>
                <br />
                Verifique que digitou a sua senha corretamente.
                <p></p>
                <br />
                Caso n&#227;o encontre o seu processo dirija-se &#224; <a href="http://www.irn.mj.pt/IRN/sections/irn/contactos">conservat&#243;ria</a> onde entregou o seu pedido.

                Na <a href="http://www.irn.mj.pt/IRN/sections/irn/contactos">Conservatória dos Registos Centrais</a>, em Lisboa, sem marcação prévia é possível obter a sua senha de acesso.
            </div>
        </div>


</div>
<hr />
<p>
    <a href="/Home/PesquisaProcesso">Voltar</a>
</p>
  

重要:

     
      
  1. 您必须对所有请求使用一个请求会话对象,因为存在启用了CSRF令牌功能的cookie会话。

  2.   
  3. 您必须发送所有初始表格数据以及您的密码或安全号码字段

  4.   
  5. 也许是您必须发送一些额外的HTTP标头才能获得响应,因为它是AJAX请求(我相信)。

  6.   

答案 1 :(得分:0)

似乎没有使用该名称的输入。看看是否可以使用更稳定的选择器。