尝试使用mechanize登录quora

时间:2013-07-04 13:28:32

标签: python http mechanize quora

我正在尝试使用mechanize模块登录quora.com。

这是我的代码:

#!/usr/bin/env python

import mechanize
import cookielib

br = mechanize.Browser() # create a browser object
br.set_handle_equiv(True)
br.set_handle_redirect(True)
br.set_handle_referer(True)
br.set_handle_robots(False)
br.set_headers = [('User-Agent', 'Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML, like Gecko)')]

cj = cookielib.LWPCookieJar()
br.set_cookiejar(cj)

res = br.open('http://www.quora.com')
br.select_form(nr = 0)

br.form['email'] = 'uuuu'
br.form['password'] = 'pppp'

res = br.submit()
print res.read()

我收到此错误:

Traceback (most recent call last):
  File "mech.py", line 29, in <module>
    res = br.submit()
  File "/usr/local/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 541, in submit
    return self.open(self.click(*args, **kwds))
  File "/usr/local/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 203, in open
    return self._mech_open(url, data, timeout=timeout)
  File "/usr/local/lib/python2.7/dist-packages/mechanize/_mechanize.py", line 255, in _mech_open
    raise response
mechanize._response.httperror_seek_wrapper: HTTP Error 500: Internal Server Error

我知道HTTP 500意味着服务器端出现了问题,但我不知道它是什么,以及如何调试问题。

1 个答案:

答案 0 :(得分:5)

我打赌mechanize无法处理提交此特定表单。使用浏览器开发人员工具,您可以看到此表单是通过javascript XHR POST请求提交给https://www.quora.com/webnode2/server_call_POST?__instart__的,其中请求参数如下所示:

json:{"args":[],"kwargs":{"email":"<email>","password":"<password>","passwordless":1}}
formkey:62c4f0d88246bfd81b27cf0dca410d75
window_id:dep4-4597603286175583039
_lm_transaction_id:0.4317954108119011
_lm_window_id:dep4-4597603286175583039
__vcon_json:["hmac","t1cKg1QhQsYPCA"]
__vcon_method:do_login
js_init:{}

仅供参考,如果您通过br.set_debug_http(True)启用调试,您将看到幕后的脚本通过以下参数向主页http://www.quora.com发出POST请求:

email=<email>
allow_passwordless=allow_passwordless
password=<password>

如您所见:错误的网址和不同的参数。

对于这种特殊情况,我会使用selenium

希望有所帮助。