Question

我收到了HTML文件，它们包含类似"（“），ü（ü）等字符串。

我需要他们可读的。所以我可以使用str.replace()。但是没有Python3的包/库，它自己知道所有字符代码并且可以处理它吗？

Answer 1

您可以使用html.unescape()：

import html
print(html.unescape('&quot;&#252;'))

Answer 2

选择解决方案here。它被称为decode（或unescape），是的，有一个库。