Google App Engine上的间歇性DownloadError应用程序错误2

时间:2010-08-13 15:06:24

标签: python google-app-engine urllib2

我们有两个应用程序都在Google App Engine上运行。 App1向app2发出请求,作为经过身份验证的用户。通过从Google ClientLogin请求与cookie交换的身份验证令牌来进行身份验证。然后将cookie用于后续请求(如here所述)。 App1运行以下代码:

class AuthConnection:

    def __init__(self):            
        self.cookie_jar = cookielib.CookieJar()    
        self.opener = urllib2.OpenerDirector()
        self.opener.add_handler(urllib2.ProxyHandler())
        self.opener.add_handler(urllib2.UnknownHandler())
        self.opener.add_handler(urllib2.HTTPHandler())
        self.opener.add_handler(urllib2.HTTPRedirectHandler())
        self.opener.add_handler(urllib2.HTTPDefaultErrorHandler())
        self.opener.add_handler(urllib2.HTTPSHandler())
        self.opener.add_handler(urllib2.HTTPErrorProcessor())
        self.opener.add_handler(urllib2.HTTPCookieProcessor(self.cookie_jar))
        self.headers = {'User-Agent': 'Mozilla/5.0 (Windows; U; ' +\
                                       'Windows NT 6.1; en-US; rv:1.9.1.2) ' +\
                                       'Gecko/20090729 Firefox/3.5.2 ' +\
                                       '(.NET CLR 3.5.30729)'
                        }

    def fetch(self, url, method, payload=None):
        self.__updateJar(url)
        request = urllib2.Request(url)
        request.get_method = lambda: method
        for key, value in self.headers.iteritems():
            request.add_header(key, value)
        response = self.opener.open(request)
        return response.read()

    def __updateJar(self, url):

        cache = memcache.Client()
        cookie = cache.get('auth_cookie')

        if cookie:
            self.cookie_jar.set_cookie(cookie)
        else:
            cookie = self.__retrieveCookie(url=url)
            cache.set('auth_cookie', cookie, 5000)  


    def __getCookie(self, url):
        auth_url = 'https://www.google.com/accounts/ClientLogin'
        auth_data = urllib.urlencode({'Email': USER_NAME,
                                      'Passwd': PASSPHRASE,
                                      'service': 'ah',
                                      'source':  'app1',
                                      'accountType': 'HOSTED_OR_GOOGLE' })
        auth_request = urllib2.Request(auth_url, data=auth_data)
        auth_response_body = self.opener.open(auth_request).read()
        auth_response_dict = dict(x.split('=') 
                for x in auth_response_body.split('\n') if x)
        cookie_args = {}
        cookie_args['continue'] = url
        cookie_args['auth'] = auth_response_dict['Auth']
        cookie_url = 'https://%s/_ah/login?%s' %\
                ('app2.appspot.com', (urllib.urlencode(cookie_args)))
        cookie_request = urllib2.Request(cookie_url)

        for key, value in self.headers.iteritems():
            cookie_request.add_header(key, value)

        try:
            self.opener.open(cookie_request)
        except:
            pass

        for cookie in self.cookie_jar:                         
            if cookie.domain == 'app2domain':
                return cookie

对于10-30%的请求,会引发DownloadError:

Error fetching https://app2/Resource
Traceback (most recent call last):
  File "/base/data/home/apps/app1/5.344034030246386521/source/main/connection/authenticate.py", line 112, in fetch
    response = self.opener.open(request)
  File "/base/python_runtime/python_dist/lib/python2.5/urllib2.py", line 381, in open
    response = self._open(req, data)
  File "/base/python_runtime/python_dist/lib/python2.5/urllib2.py", line 399, in _open
    '_open', req)
  File "/base/python_runtime/python_dist/lib/python2.5/urllib2.py", line 360, in _call_chain
    result = func(*args)
  File "/base/python_runtime/python_dist/lib/python2.5/urllib2.py", line 1115, in https_open
    return self.do_open(httplib.HTTPSConnection, req)
  File "/base/python_runtime/python_dist/lib/python2.5/urllib2.py", line 1080, in do_open
    r = h.getresponse()
  File "/base/python_runtime/python_dist/lib/python2.5/httplib.py", line 197, in getresponse
    self._allow_truncated, self._follow_redirects)
  File "/base/data/home/apps/app1/5.344034030246386521/source/main/connection/monkeypatch_urlfetch_deadline.py", line 18, in new_fetch
    follow_redirects, deadline, *args, **kwargs)
  File "/base/python_runtime/python_lib/versions/1/google/appengine/api/urlfetch.py", line 241, in fetch
    return rpc.get_result()
  File "/base/python_runtime/python_lib/versions/1/google/appengine/api/apiproxy_stub_map.py", line 501, in get_result
    return self.__get_result_hook(self)
  File "/base/python_runtime/python_lib/versions/1/google/appengine/api/urlfetch.py", line 325, in _get_fetch_result
    raise DownloadError(str(err))
DownloadError: ApplicationError: 2 

app2(“服务器”)的请求日志似乎正常,正如预期的那样(根据文档DownloadError仅在没有有效的HTTP响应时才会引发)。

为什么会引发异常?

1 个答案:

答案 0 :(得分:3)

看到这个: http://bitbucket.org/guilin/gae-rproxy/src/tip/gae_rproxy/niceurllib.py

因为urllib和urllib2默认处理http 302代码,并自动重定向到服务器告诉它的内容。但是当重定向时,它不包含服务器告诉它的cookie。

例如:

  1. urllib2 request // server / login
  2. 服务器响应302, // server / profile,set-cookie: 会话id:XXXX
  3. urllib2请求 //服务器/简档
  4. 服务器响应未登录错误或 500错误导致没有 session-id found。
  5. urllib2 throw error
  6. 所以,你没有机会设置cookie。

    self.opener.add_handler(urllib2.HTTPRedirectHandler中())

    我认为你应该删除这一行并添加你自己的HTTPRedirectHandler,它更新的错误也不会自动重定向,只需返回http代码和标题,这样你就有机会设置cookie。