我有一个数据库字段,它将XML文档存储为Unicode。但是,当我获取该字段并尝试启动lxml
节点时,我收到以下错误:
node = etree.fromstring(self.xml)
ValueError: Unicode strings with encoding declaration are not supported. Please use bytes input or XML fragments without declaration.
我当前的文本(self.xml
)包含日文字符等。我将如何创建节点?
答案 0 :(得分:5)
如果你有unicode,你可以为lxml
指定utf-8解析器:
utf8_parser = etree.XMLParser(encoding='utf-8')
node = etree.fromstring(self.xml.encode('utf-8'), parser=utf8_parser)