Question

我找不到任何东西，我不能使用BeautifulSoap。我有一个XML URL文档。我想解析所有称为Contenido的项目，但它有atribs，我不知道如何获得它们。我尝试过使用xml.sax，但我不知道如何在attrib

中使用它

<contenido>
<tipo>Evento</tipo>
<atributos idioma="es">
<atributo nombre="ID-EVENTO">8006941</atributo>
<atributo nombre="TITULO">582 mapas, Compañía Teatral “La Cola del Pavo”</atributo>
<atributo nombre="TIPO">
/contenido/actividades/RecitalesPresentacionesActosLiterarios
</atributo>
</atributos>
</contenido>

Answer 1

这是使用xml.sax

的示例

import xml.sax

class MyHandler( xml.sax.ContentHandler ):

    def __init__(self):
        self.is_atributo = False

    def startElement(self, tag, attributes):
       if tag == 'atributo':
           self.is_atributo = True

    def characters(self, content):
        if self.is_atributo:
            print(content)

if __name__ == '__main__':
    parser = xml.sax.make_parser()
    parser.setFeature(xml.sax.handler.feature_namespaces, 0)

    Handler = MyHandler()
    parser.setContentHandler(Handler)
    parser.parse('your.xml')

Python XML解析了这个doc

1 个答案: