如何简化我的代码-提取相同xml标记名称的所有节点值

时间:2018-11-30 08:41:49

标签: python xml minidom

例如,这是XML数据:

   <SOAP-ENV:Body>
     <reportList>
        <reportName>report 1</reportName>
     </reportList>
     <reportList>
        <reportName>report 2</reportName>
     </reportList>
     <reportList>
        <reportName>report 3</reportName>
     </reportList>
   </SOAP-ENV:Body>

这是我的代码,用于提取所有 reportName 的节点值,并且有效。

import xml.dom.minidom
...
node = xml.dom.minimom.parseString(xml_file.text).documentElement
reportLists = node.getElementsByTagName('reportList')

reports = []
for reportList in reportLists:
    reportObj = reportList.getElementsByTagName('reportName')[0]
    reports.append(reportObj)

for report in reports:
    nodes = report.childNodes
    for node in nodes:
        if node.nodeType == node.TEXT_NODE:
            print (node.data)

结果:

report 1
report 2
report 3

尽管可以,但是我想简化代码。如何使用较短的代码获得相同的结果?

1 个答案:

答案 0 :(得分:0)

您可以使用列表推导来简化两个for循环:

import xml.dom.minidom
node = xml.dom.minidom.parseString(xml_file.text).documentElement
reportLists = node.getElementsByTagName('reportList')

reports = [report.getElementsByTagName('reportName')[0] for report in reportLists]
node_data = [node.data for report in reports for node in report.childNodes if node.nodeType == node.TEXT_NODE]

node_data现在是一个包含您正在打印的信息的列表。