应用错误收集

当我在python 3.4中使用urllib.request获取页面时，我得到一个以

开头的字节流

b'\x1f\x8b\x08\x00\x00\x00\x00\x00\x00\x03\xecY'

我无法识别字符编码。出现以下错误：

sample.decode('utf-8')
---------------------------------------------------------------------------
UnicodeDecodeError                        Traceback (most recent call last)
<ipython-input-16-1a4f8f09934e> in <module>()
----> 1 sample.decode('utf-8')

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte

使用的网站是：http://indianrail.gov.in

有人可以帮忙吗？

无法识别字符编码

0 个答案: