Question

我在python中写蜘蛛。我得到了一个列表，其中包含一个元素[u'\xb9\xd8\xd3\xda\xbf\xaa\xd5\xb9]，它是GBK代码＆＃34;关于开展＆＃34;。我尝试过一些方法，但都没有。

Answer 1

通常，编码字符串为str，解码为unicode。你得到的编码unicode是由错误的解码引起的。您可以通过str将其转换回encode('latin1')，然后按GBK解码：

>>> text = u'\xb9\xd8\xd3\xda\xbf\xaa\xd5\xb9'
>>> text = text.encode('latin1')
>>> text
'\xb9\xd8\xd3\xda\xbf\xaa\xd5\xb9'
>>> text = text.decode('gbk')
>>> text
u'\u5173\u4e8e\u5f00\u5c55'

然后你可以打印出来。

如何在python中打印GBK编码的单词？

1 个答案: