unicode的未知编码

时间:2010-11-09 01:19:25

标签: python unicode

这个编码的名称是什么?

  

SMB:// NAS /音乐/ _lib / V / VOIVOD / VOIVOD-RRR%C3%B6%C3%B6%C3%B6aaarrr / 01%20-%20voivod%20-%20rrr%C3%B6%C3 %B6%C3%B6aaarrr%20-%20korg%C3%BCll_the_exterminator.mp3

我想使用Python将这样的字符串转换为unicode。我该怎么做?

3 个答案:

答案 0 :(得分:3)

这是URL编码的UTF-8。对其进行URL解码,然后将其解码为UTF-8。

答案 1 :(得分:0)

尝试urllib.unquote()。

答案 2 :(得分:0)

import urllib

>>> m="smb://nas/music/_lib/v/voivod/voivod-rrr%C3%B6%C3%B6%C3%B6aaarrr/01%20-%20voivod%20-%20rrr%C3%B6%C3%B6%C3%B6aaarrr%20-%20korg%C3%BCll_the_exterminator.mp3"
>>> m1= urllib.unquote(m)
>>> print(repr(m))
'smb://nas/music/_lib/v/voivod/voivod-rrr\xc3\xb6\xc3\xb6\xc3\xb6aaarrr/01 - voivod - rrr\xc3\xb6\xc3\xb6\xc3\xb6aaarrr - korg\xc3\xbcll_the_exterminator.mp3'
>>> m2= m1.decode('utf8')
>>> print(m2)
smb://nas/music/_lib/v/voivod/voivod-rrröööaaarrr/01 - voivod - rrröööaaarrr - korgüll_the_exterminator.mp3