我正在尝试使用requests.get
捕获https响应,但我没有获得与在Web浏览器中看到的相同的信息。实际网页将包含用户名,密码字段和提交按钮。网页是用java脚本编写的。
所以,我想知道是否可以使用请求模块捕获https响应?任何帮助都将受到高度赞赏
>>> cafile='/etc/pki/tls/mycertfile.pem'
>>> import requests
>>> r=requests.get('https://10.22.156.151/performance',verify=cafile)
/opt/rh/python27/root/usr/lib/python2.7/site-packages/urllib3/connection.py:344: SubjectAltNameWarning: Certificate for 10.22.156.151 has no `subjectAltName`, falling back to check for a `commonName` for now. This feature is being removed by major browsers and deprecated by RFC 2818. (See https://github.com/shazow/urllib3/issues/497 for details.)
SubjectAltNameWarning
>>> r
<Response [200]>
>>> r.text
u'<!doctype html>\n<html>\n<!--\n# Copyright (c) 2001-2018 Hewlett Packard Enterprise Development LP\n-->\n<head>\n <meta charset="utf-8"/>\n <meta http-equiv="X-UA-Compatible" content="IE=edge"/>\n\n <script src="/js/mocha_history_compat.js"></script>\n\n <link rel="shortcut icon" href="/noauth/theme/airwave/favicon.ico" type="image/x-icon" />\n\n\n <link rel="stylesheet" href="/mercury.1516735736.css" />\n\n\n <link rel="stylesheet" href="/helpdesk/style/style.1516735737.css"/>\n <!--[if !IE]><!-->\n <link rel="stylesheet" href="/helpdesk/style/not_ie.1516735737.css"/>\n <!--<![endif]-->\n\n <link rel="stylesheet" href="/frappe/style/style.1516735769.css"/>\n\n\n <script src="/mercury.1516735736.js"></script>\n\n\n\n <script src="/frappe/script/module_frappe.1516735769.js"></script>\n\n\n\n <script src="/helpdesk/script/third_party/highcharts/highcharts.1516735737.js"></script>\n <script src="/helpdesk/script/third_party/highcharts/highcharts-more.1516735737.js"></script>\n <script src=""></script>\n\n\n\n <script type="text/javascript">\n window.mocha_app_args = {\n "dev_mode" : null,\n "accept_language_locale" : null,\n "vendor_name" : "Aruba",\n "mocha_en_mtime" : 1516735737,\n "short_copyright_info" : "© 2018 Hewlett Packard Enterprise Development LP",\n "supported_languages" : {\n "ptBR" : 0,\n "zht" : 1,\n "tr" : 1,\n "it" : 1,\n "zh" : 1,\n "es" : 1,\n "nl" : 0,\n "ko" : 1,\n "en" : 1,\n "fr" : 1,\n "de" : 1,\n "ja" : 1\n },\n "cert_auth" : {\n "client_cert_present" : false,\n "client_cert_valid" : false,\n "client_cert_enabled" : false,\n "use_two_factor_auth" : false,\n "client_cert_required" : false\n },\n "short_product_name" : "AMP",\n "theme" : "airwave",\n "product_name" : "AirWave Management Platform"\n}\n;\n </script>\n\n <script src="/js/mocha_app.js"></script>\n\n<!-- remove espresso later -->\n<script src="/espresso/script/module_espresso.js"></script>\n\n</head>\n<body class="fontface">\n <input type="text" name="mocha_history_state" id="mocha_history_state" style="display:none">\n\n <div id="app-container" class="off_canvas_wrap awapp-folder-breadcrumbs-parent">\n <div id="app-page-container" class="page inner_wrap">\n <awapp-folder-breadcrumbs></awapp-folder-breadcrumbs>\n <awapp-folder-breadcrumbs-menu></awapp-folder-breadcrumbs-menu>\n \n <div id="app-main-content">\n <div id="amp-nav">\n <!-- angular sidenav -->\n <awapp-sidenav></awapp-sidenav>\n </div>\n\n <div class="right-container">\n <!-- existing pages -->\n <div id="right-container">\n <!-- angular page -->\n <ng-view></ng-view>\n </div>\n </div>\n </div>\n </div>\n </div>\n <div class=\'copyright\'>\n<div>\n© 2018 Hewlett Packard Enterprise Development LP - <a target="_blank" href="http://www.arubanetworks.com/">http://www.arubanetworks.com/</a>\n<br/>\n\nNeither AirWave Wireless nor Aruba Networks, an HP company is connected, affiliated or related to Airwave Solutions Limited in any way whatsoever.\n\n</div>\n</div>\n\n\n\n</body>\n</html>\n'
>>> print r.text
答案 0 :(得分:0)
正如上午提到的requests
只是一个HTTP客户端,所以它只是在发送GET请求时废弃网站的HTML。
出于您的目的,您需要一个浏览器才能运行javascript。 硒是一个不错的选择,但不是最好的选择。
请务必查看无头浏览器列表here。