使用Beautiful Soup抓取具有URL的弹出窗口(否则将出错)

时间:2019-01-04 16:07:26

标签: python beautifulsoup

我正在研究一个刮擦skyward.smsd.org的科学项目,它在弹出窗口中打开,但是在页面顶部,当我转到它时,它有一个URL,而不是在弹出窗口中,它表示您的会话已过期我无法找到解决方法。另外,我还有一个无效的语法错误:msg,如果有人可以帮助我找到这些问题的解决方案

while True:

    import requests
    from bs4 import BeautifulSoup
    import time
    from time import sleep
    url = "https://skyward.smsd.org/scripts/wsisa.dll/WService=wsEAplus/sfcalendar002.w"

    headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36'}

    response = requests.get(url, headers=headers)

    soup = BeautifulSoup(response.text, "lxml")
from requests.packages.urllib3 import add_stderr_logger

add_stderr_logger()
s = requests.Session()

s.headers['User-Agent'] = 'Mozilla/5.0'

login = {login: 3078774, password: (MY PASSWORD)}
login_response = s.post(url, data=login)
for r in login_response.history:
    if r.status_code == 401:  # 401 means authentication failed
        sys.exit(1)  # abort

pdf_response = s.get(pdf_url)  # Your cookies and headers are automatically included

if str(soup).find("skyward") == -1:
    continue

time.sleep(60)



else:
     msg = 'Subject: This is the script talking, check Skyward'

#Possibilty to make this tell you exactly what is changed
#A text feature that goes out daily for missing assignments
fromaddr = '3078774@smsd.org'

toaddrs  = ['3078774@smsd.org']



print('From: ' + fromaddr)
print('To: ' + str(toaddrs))
print('Message: ' + msg)

break

0 个答案:

没有答案