Question

基本上我向一个网站提出了一个请求并得到了一个字节响应：b'[{"geonameId:"703448"}..........'.我很困惑，因为虽然它是字节类型，但它非常易读，看起来像是一个json列表。我知道响应是在运行r.encoding的latin1中编码的，它返回ISO-859-1并且我试图解码它，但它只返回一个空字符串。这是我到目前为止所做的：

r = response.content
string = r.decode("ISO-8859-1")
print (string)

这是打印空行的地方。但是当我跑步时

len(string)

我得到：回31023 如何在不返回空字符串的情况下解码这些字节？

Answer 1

另一个解决方案是使用response.text，它以unicode

返回内容

Type:        property
String form: <property object at 0x7f76f8c79db8>
Docstring:  
Content of the response, in unicode.

If Response.encoding is None, encoding will be guessed using
``chardet``.

The encoding of the response content is determined based solely on HTTP
headers, following RFC 2616 to the letter. If you can take advantage of
non-HTTP knowledge to make a better guess at the encoding, you should
set ``r.encoding`` appropriately before accessing this property.

Answer 2

您是否尝试使用json模块解析它？

import json
parsed = json.loads(response.content)

Answer 3

有r.text和r.content。第一个是字符串，第二个是字节。

你想要

import json

data = json.loads(r.text)

请求返回字节，我无法解码它们

3 个答案: