我有一个看起来像这样的XML:
<?xml version="1.0" encoding="ISO-8859-1" standalone="yes"?>
<body>
<t id="1" word="w<E4>re"/>
</body>
"w<E4>re"
是德语单词“wäre”。当我尝试使用python lxml解析此xml时,即使我应用encoding =“ iso-8859-1”,我也只会得到“ w”而不是完整的单词:
from lxml import etree as ET
for event, elem in ET.iterparse("myXML.xml", recover=True, encoding="iso-8859-1"):
if elem.tag == 't':
print(elem.attrib['word'])
如何获得“警告”?