如果我尝试:
url = "https://www.economist.com/news/finance-and-economics/21727073-economists-struggle-work-out-how-much-free-economy-comes-cost"
{:ok, %HTTPoison.Response{status_code: 200, body: body}} = HTTPoison.get(url)
IO.binwrite body
我在控制台中看到乱码文本(而不是html)。但如果我在网页上查看源代码,我会在那里看到HTML。我做错了什么?
PS:它适用于js http客户端(axios.js),不知道为什么它不适用于httpoison
答案 0 :(得分:5)
该URL以gzip格式返回正文,并通过发送标题Content-Encoding: gzip
来指示此情况。 hackney
,HTTPoison库是基于的,不会自动解码。此功能will likely be added at some point。在此之前,如果:zlib
为Content-Encoding
,您可以使用gzip
模块自行解码身体:
url = "https://www.economist.com/news/finance-and-economics/21727073-economists-struggle-work-out-how-much-free-economy-comes-cost"
{:ok, %HTTPoison.Response{status_code: 200, headers: headers, body: body}} = HTTPoison.get(url)
gzip? = Enum.any?(headers, fn {name, value} ->
# Headers are case-insensitive so we compare their lower case form.
:hackney_bstr.to_lower(name) == "content-encoding" &&
:hackney_bstr.to_lower(value) == "gzip"
end)
body = if gzip?, do: :zlib.gunzip(body), else: body
IO.write body