我正试图绕过某个没有API的服务并决定尝试使用Mechanize(我通常使用urllib)。
如何为一个open
电话添加特定标头?
或者有没有办法构建一个带有自己标头的Request实例,然后让我的mechanize.Browser
实例处理它?</ p>
browser = mechanize.Browser()
headers = [
('Accept', 'text/javascript, text/html, application/xml, text/xml, */*'),
('Content-type', 'application/x-www-form-urlencoded; charset=UTF-8'),
('User-Agent', 'Foobar'),
]
browser.addheaders = headers
# log in, do stuff, etc.
# here, for this one browser request, I need to add an AJAX header
browser.open('/a_url_to_ajax_post/', urllib.urlencode({'foo': 'bar'}))
我的解决方法是暂时修改addheaders列表,但哇哇哇哇!
browser.addheaders.append(AJAX_HEADER)
browser.open('/admin/discounts', urllib.urlencode(pulled_params))
browser.addheaders.pop()
答案 0 :(得分:7)
这样做:
import mechanize
import urllib2
browser = mechanize.Browser()
# setup your header, add anything you want
header = {'User-Agent': 'Mozilla/5.0 (Windows NT 5.1; rv:14.0) Gecko/20100101 Firefox/14.0.1', 'Referer': 'http://whateveritis.com'}
url = "http://google.com"
# wrap the request. You can replace None with the needed data if it's a POST request
request = urllib2.Request(url, None, header)
# here you go
response = browser.open(request)
print response.geturl()
print response.read()
response.close()
答案 1 :(得分:1)
您可以使用python with
语句。做一个这样的课:
class ExtraHeaders(object):
def __init__(self, br, headers):
self.extra_headers = headers
self.br = br
def __enter__(self):
self.old_headers = self.br.addheaders
self.br.addheaders = self.extra_headers + [h for h in self.br.addheaders if
not reduce(
lambda accum, ex_h: accum or ex_h[0] == h[0],self.extra_headers,False)]
return self.br
def __exit__(self, type, value, traceback):
self.br.addheaders = self.old_headers
然后以这种方式使用它:
with ExtraHeaders(browser, [AJAX_HEADER]):
browser.open('/admin/discounts', urllib.urlencode(pulled_params))
#requests beyond this point won't have AJAX_HEADER
请注意,如果你是多线程的,那么访问浏览器的任何线程,而另一个线程在with语句中,也会有额外的头文件。