当我尝试使用urllib2发送图像时,会发生UnicodeDecodeError异常。
HTTP帖子正文:
f = open(imagepath, "rb")
binary = f.read()
mimetype, devnull = mimetypes.guess_type(urllib.pathname2url(imagepath))
body = """Content-Length: {size}
Content-Type: {mimetype}
{binary}
""".format(size=os.path.getsize(imagepath),
mimetype=mimetype,
binary=binary)
request = urllib2.Request(url, body, headers)
opener = urllib2.build_opener(urllib2.HTTPSHandler(debuglevel=1))
response = opener.open(request)
print response.read()
追溯:
response = opener.open(request)
File "/usr/local/lib/python2.7/urllib2.py", line 404, in open
response = self._open(req, data)
File "/usr/local/lib/python2.7/urllib2.py", line 422, in _open
'_open', req)
File "/usr/local/lib/python2.7/urllib2.py", line 382, in _call_chain
result = func(*args)
File "/usr/local/lib/python2.7/urllib2.py", line 1222, in https_open
return self.do_open(httplib.HTTPSConnection, req)
File "/usr/local/lib/python2.7/urllib2.py", line 1181, in do_open
h.request(req.get_method(), req.get_selector(), req.data, headers)
File "/usr/local/lib/python2.7/httplib.py", line 973, in request
self._send_request(method, url, body, headers)
File "/usr/local/lib/python2.7/httplib.py", line 1007, in _send_request
self.endheaders(body)
File "/usr/local/lib/python2.7/httplib.py", line 969, in endheaders
self._send_output(message_body)
File "/usr/local/lib/python2.7/httplib.py", line 827, in _send_output
msg += message_body
File "/home/usertmp/biogeek/lib/python2.7/encodings/utf_8.py", line 16, in decode
return codecs.utf_8_decode(input, errors, True)
UnicodeDecodeError: 'utf8' codec can't decode byte 0xff in position 49: invalid start byte
python版本2.7.5
任何人都知道解决方案吗?
答案 0 :(得分:2)
您正在尝试发送包含标题和内容的正文。如果要发送内容类型和内容长度,则需要在标题中进行,而不是在正文中:
headers = {'Content-Type': mimetype, 'Content-Length', str(size)}
request = urllib2.Request(url, data=binary, headers=headers)
如果您未设置Content-Length标头,则会自动将其设置为data
关于你的错误:它发生在行
msg += message_body
如果这两个字符串中的一个是unicode
,而另一个str
包含\xff
,则只会发生此错误,因为在这种情况下,后者将自动同步为unicode sys.getdefaultencoding()
。
我的最终猜测是:message_body
这是data
,str
,某处包含\xff
。 msg
是先前传递给HTTPConnection的内容,即头文件,它们是unicode,因为你在头文件中至少使用了一个密钥的unicode(之前的值转换为str
),或者您已从unicode_literals
导入__futures__
。