Python Mechanize - 如何在单个.open()调用中添加标头?

时间:2012-08-30 20:46:46

标签: python mechanize

我正试图绕过某个没有API的服务并决定尝试使用Mechanize(我通常使用urllib)。

如何为一个open电话添加特定标头?

或者有没有办法构建一个带有自己标头的Request实例,然后让我的mechanize.Browser实例处理它?<​​/ p>

browser = mechanize.Browser()
headers = [
    ('Accept', 'text/javascript, text/html, application/xml, text/xml, */*'),
    ('Content-type', 'application/x-www-form-urlencoded; charset=UTF-8'),
    ('User-Agent', 'Foobar'),
]

browser.addheaders = headers
# log in, do stuff, etc.

# here, for this one browser request, I need to add an AJAX header
browser.open('/a_url_to_ajax_post/', urllib.urlencode({'foo': 'bar'}))

我的解决方法是暂时修改addheaders列表,但哇哇哇哇!

browser.addheaders.append(AJAX_HEADER)
browser.open('/admin/discounts', urllib.urlencode(pulled_params))
browser.addheaders.pop()

2 个答案:

答案 0 :(得分:7)

这样做:

import mechanize
import urllib2

browser = mechanize.Browser()

# setup your header, add anything you want
header = {'User-Agent': 'Mozilla/5.0 (Windows NT 5.1; rv:14.0) Gecko/20100101 Firefox/14.0.1', 'Referer': 'http://whateveritis.com'}
url = "http://google.com"

# wrap the request. You can replace None with the needed data if it's a POST request
request = urllib2.Request(url, None, header)

# here you go
response = browser.open(request)

print response.geturl()
print response.read()
response.close()

答案 1 :(得分:1)

您可以使用python with语句。做一个这样的课:

class ExtraHeaders(object):
    def __init__(self, br, headers):
        self.extra_headers = headers
        self.br = br
    def __enter__(self):
        self.old_headers = self.br.addheaders
        self.br.addheaders = self.extra_headers + [h for h in self.br.addheaders if 
            not reduce(
                lambda accum, ex_h: accum or ex_h[0] == h[0],self.extra_headers,False)]
        return self.br
    def __exit__(self, type, value, traceback):
        self.br.addheaders = self.old_headers

然后以这种方式使用它:

with ExtraHeaders(browser, [AJAX_HEADER]):
    browser.open('/admin/discounts', urllib.urlencode(pulled_params))
#requests beyond this point won't have AJAX_HEADER

请注意,如果你是多线程的,那么访问浏览器的任何线程,而另一个线程在with语句中,也会有额外的头文件。