通过python提交帖子,通过bs4抓取

时间:2019-10-28 04:09:12

标签: python-3.x web-scraping beautifulsoup python-requests facebook-app-requests

我正在尝试制作一种工具,可抓取Facebook密码重设链接的结果页面并让脚本打印出电子邮件,但是我的问题是未提交有效负载并不确定我做错了什么,但会有所帮助,我对抓取是陌生的,并试图使脚本使用最少的行,直到我变得更好为止。

简而言之,尝试提交帖子并抓取下一页的电子邮件...

from bs4 import BeautifulSoup
import requests

url = "https://www.facebook.com/login/identify/"
target_profile = raw_input("Enter the Target's Profile Link: ")
payload = {"email": target_profile, "submit": "submit"}
r = requests.post(url, data=payload)

html_soup = BeautifulSoup(r.content, 'html.parser')
type(html_soup)
#info_container = html_soup.find_all('div', class_= 'uiInputLabel clearfix uiInputLabelLegacy')
#print(type(info_container))
#print(len(info_container))

email_scraper = html_soup.find('div', class_= '_8u _42ef')

for text in email_scraper:
        print(text.prettify())

1 个答案:

答案 0 :(得分:0)

您忘记了使用json.dumps序列化有效负载:

import json

payload = json.dumps({"email": target_profile, "submit": "submit"})
r = requests.post(url, data=payload)
...