Question

import hashlib
string1 = u'test'
hashstring = hashlib.md5()
hashstring.update(string1)
string2 = hashstring.digest()

unicode(string2)

UnicodeDecodeError: 'ascii' codec can't decode byte 0x8f in position 1: ordinal
not in range(128)

字符串是unicode，它对我有用，可以这样做吗？使用python 2.7，如果这有帮助......

Answer 1

伊格纳西奥给出了完美的答案。只是一个补充：当你将一些字符串从一个在ASCII中找不到字符的编码转换为unicode时，你必须将编码作为参数传递：

>>> unicode("órgão")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 0: ordinal not in range(128)
>>> unicode("órgão", "UTF-8")
u'\xf3rg\xe3o'

如果您不能说原始编码是什么（在我的示例中为UTF-8），您实际上无法转换为Unicode。这是一个信号，表明你的意图不太正确。

最后但并非最不重要的是，编码很混乱。这个comprehensive text about them可以说清楚。

Answer 2

.digest()的结果是bytestring¹，因此将其转换为Unicode是没有意义的。如果您想要可读的表示，请使用.hexdigest()。

¹有些字节串可以转换为Unicode，但.digest()返回的字节串不包含文本数据。它们可以包含任何字节，包括空字节：如果不使用转义序列，它们通常是不可打印的。

将hash.digest（）转换为unicode

2 个答案: