Question

我正在尝试解码和ascii，它与字符串

结合使用

例如

g&#108bo&#115w&#111&#114t&#104

但我没有得到确切的输出

'g&#108bo&#115w&#111&#114t&#104'.decode("ascii")

输出

u'g&#108bo&#115w&#111&#114t&#104'

如果你删除这个字符＆amp;＃并只尝试整数我得到这个

>>> chr(108)
'l'
>>> chr(115)
's'
>>> chr(111)
'o'
>>> chr(114)
'r'
>>> chr(104)
'h'

预期产出

glbosworth

我如何解码这个＆＃34; g＆amp;＃108bo＆amp;＃115w＆amp;＃111＆amp;＃114t＆amp;＃104＆＃34;预期产出

Answer 1

您正在尝试解码html escaped string。您可以使用html.unescape(s)函数执行此操作（在python3上）：

import html
print(html.unescape('g&#108bo&#115w&#111&#114t&#104'))

输出：

'glbosworth'

看一下this所以回答更多信息

Answer 2

，您可以使用html.unescape：

import html
print(html.unescape('g&#108bo&#115w&#111&#114t&#104'))

你可以使用HTMLParser：

from HTMLParser import HTMLParser
h = HTMLParser()
print(h.unescape('g&#108bo&#115w&#111&#114t&#104'))