Question

我正在使用ElementTree和Python来解析XML文件以找到子项的内容

这是我试图解析的XML文件：

<?xml version='1.0' encoding='UTF-8'?>
<nvd xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://nvd.nist.gov/feeds/cve/1.2" nvd_xml_version="1.2" pub_date="2016-02-10" xsi:schemaLocation="http://nvd.nist.gov/feeds/cve/1.2  http://nvd.nist.gov/schema/nvdcve_1.2.1.xsd">
  <entry type="CVE" name="CVE-1999-0001" seq="1999-0001" published="1999-12-30" modified="2010-12-16" severity="Medium" CVSS_version="2.0" CVSS_score="5.0" CVSS_base_score="5.0" CVSS_impact_subscore="2.9" CVSS_exploit_subscore="10.0" CVSS_vector="(AV:N/AC:L/Au:N/C:N/I:N/A:P)">
    <desc>
      <descript source="cve">ip_input.c in BSD-derived TCP/IP implementations allows remote attackers to cause a denial of service (crash or hang) via crafted packets.</descript>
    </desc>
    <loss_types>
      <avail/>
    </loss_types>
    <range>
      <network/>
    </range>
    <refs>
      <ref source="OSVDB" url="http://www.osvdb.org/5707">5707</ref>
      <ref source="CONFIRM" url="http://www.openbsd.org/errata23.html#tcpfix">http://www.openbsd.org/errata23.html#tcpfix</ref>
    </refs>

这是我的代码：

import xml.etree.ElementTree as ET

if __name__ == '__main__':
  tree = ET.parse('nvdcve-modified.xml')
  root = tree.getroot()

  print root.find('entry')
  print root[0].find('desc')

无线

中两行的输出

Answer 1

您的XML在根元素级别定义了默认名称空间：

xmlns="http://nvd.nist.gov/feeds/cve/1.2"

没有前缀的后代元素隐式地继承祖先的默认命名空间。要在命名空间中查找元素，可以将前缀映射到命名空间URI并使用前缀，如下所示：

ns = {'d': 'http://nvd.nist.gov/feeds/cve/1.2'}
root.find('d:entry', ns)

或直接使用名称空间URI：

root.find('{http://nvd.nist.gov/feeds/cve/1.2}entry')

ElementTree find（）始终返回None

1 个答案: