使用lxml模块时出现未知错误 - 解析XML

时间:2016-11-29 20:19:33

标签: python python-3.x lxml

我正在使用Python 101学习并在一个示例中我收到错误并且不知道如何修复它 - 我的代码与书中的100%相同(已经检查过3次)它仍然输出这个错误。 这是代码:

from lxml import etree

def parseXML(xmlFile):
    """
    Parse the xml
    """
    with open(xmlFile) as fobj:
        xml = fobj.read()

    root = etree.fromstring(xml)

    for appt in root.getchildren():
        for elem in appt.getchildren():
            if not elem.text:
                text = 'None'
            else:
                text = elem.text
            print(elem.tag + ' => ' + text)

if __name__ == '__main__':
    parseXML('example.xml')

这里是xml文件(它与书中的相同):

<?xml version="1.0" ?>
<zAppointments reminder-"15">
    <appointment>
        <begin>1181251600</begin>
        <uid>0400000008200E000</uid>
        <alarmTime>1181572063</alarmTime>
        <state></state>
        <location></location>
        <duration>1800</duration>
        <subject>Bring pizza home</subject>
    </appointment>
    <appointment>
        <begin>1234567890</begin>
        <duration>1800</duration>
        <subject>Check MS office webstie for updates</subject>
        <state>dismissed</state>
        <location></location>
        <uid>502fq14-12551ss-255sf2</uid>
     </appointment>
</zAppointments>

编辑:Sry,对我的第一篇文章感到非常兴奋,我实际上忘记了错误代码。

Traceback (most recent call last):
  File "/home/michal/Desktop/nauka programowania/python 101/parsing_with_lxml.py", line 21, in <module>
    parseXML('example.xml')
  File "/home/michal/Desktop/nauka programowania/python 101/parsing_with_lxml.py", line 10, in parseXML
    root = etree.fromstring(xml)
  File "src/lxml/lxml.etree.pyx", line 3213, in lxml.etree.fromstring (src/lxml/lxml.etree.c:77737)
  File "src/lxml/parser.pxi", line 1830, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:116674)
  File "src/lxml/parser.pxi", line 1711, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:115220)
  File "src/lxml/parser.pxi", line 1051, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:109345)
  File "src/lxml/parser.pxi", line 584, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:103584)
  File "src/lxml/parser.pxi", line 694, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:105238)
  File "src/lxml/parser.pxi", line 624, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:104147)
lxml.etree.XMLSyntaxError: Specification mandate value for attribute reminder-, line 2, column 25

感谢您的帮助!!

2 个答案:

答案 0 :(得分:2)

可以在此处找到xml中唯一的错误:<zAppointments reminder-"15">,应为:<zAppointments reminder="15">

将来可以在线找到验证xml的有用工具。 例如:https://www.xmlvalidation.com/

答案 1 :(得分:0)

错误可能在

<zAppointments reminder-"15">

要进行下一次验证,请尝试使用xmllint

xmllint --valid --noout example.xml