python - 从字符串中删除非ascii字符？（在python中） - Thinbug

从字符串中删除非ascii字符？（在python中）

时间：2017-12-29 19:43:23

标签： python html parsing beautifulsoup ascii

我正在尝试解析HTML文件中的字符串，该文件包含多行ascii和非ascii字符，如下所示：

"industrial light & \u003cbr\u003emagic, lucasarts"

我尝试使用encode函数将字符串编码为ascii，但它只返回放入其中的相同值。

str = "industrial light & \u003cbr\u003emagic, lucasarts"
str.encode('ascii',errors='ignore')
returns "industrial light & \u003cbr\u003emagic, lucasarts"

非常感谢任何帮助。

1 个答案:

答案 0 :(得分：0)

我发现了问题。我试图在python 2中解码它.Python 2和python 3以不同的方式处理这种转换。一旦我在python 3中尝试过它一切正常。谢谢大家的帮助！