Question

我有这个XML文件，我希望获得具有模式＆＃39;的国家/地区节点。以他们的名义。

<?xml version="1.0"?>
<data>
    <country name="Liechtenstein">
        <rank>1</rank>
        <year>2008</year>
        <gdppc>141100</gdppc>
        <neighbor name="Austria" direction="E"/>
        <neighbor name="Switzerland" direction="W"/>
    </country>
    <country name="Singapore">
        <rank>4</rank>
        <year>2011</year>
        <gdppc>59900</gdppc>
        <neighbor name="Malaysia" direction="N"/>
    </country>
    <country name="Panama">
        <rank>68</rank>
        <year>2011</year>
        <gdppc>13600</gdppc>
        <neighbor name="Costa Rica" direction="W"/>
        <neighbor name="Colombia" direction="E"/>
    </country>
</data>

我试过这个

    import xml.etree.ElementTree as ET
    tree = ET.parse('test.xml')
    root = tree.getroot()
    list=root.find(".//country[contains(@name, 'Pana')]")

但是我收到一个错误：SyntaxError：无效的谓词

有人可以帮忙解决这个问题吗？

Answer 1

我无法评论为什么原始代码不起作用，但与XPath表达式无关。表达式很好，除了可以安全省略的前导.。

您有没有使用lxml xpath() method？

的原因

from lxml import etree
tree = etree.parse('etree.xml')
root = tree.getroot()
list = root.xpath("//country[contains(@name,'Pana')]")

print list[0].tag

返回country元素：

$ python test.py
country

Answer 2

您使用的xml解析器不支持contains。您将需要使用不同的解析器来获得完整的xpath支持

https://docs.python.org/2/library/xml.etree.elementtree.html#elementtree-xpath

Answer 3

xml.etree.ElementTree仅提供对XPath表达式的有限支持，用于在树中定位元素，并且不包括xpath contains()函数。有关支持的xpath语法列表，请参阅the documentation。

您需要求助于提供更好xpath支持的库，例如lxml，或者使用更简单的xpath并手动进行进一步过滤，例如：

import xml.etree.ElementTree as ET
tree = ET.parse('test.xml')
root = tree.getroot()
list = filter(lambda x: 'Pana' in x.get('name'), root.findall(".//country[@name]"))

如何在python中搜索具有给定属性值的Xml节点

3 个答案: