Question

我整个上午一直在努力解决这个问题，而且我还没有能够让它发挥作用。

我有这样的XML（剥离匿名版本）：

<?xml version="1.0" encoding="UTF-8"?>
<Root>
  <First_Level_Node>
    <Element_Name>
      <attribute1>1</attribute1>
      <attribute2>2</attribute2>
      <attribute3>3</attribute3>
      <attribute4>4</attribute4>
      <attribute5>5</attribute5>
      <attribute6>6</attribute6>
    </Element_Name>
    <Element_Name>
      <attribute1>42</attribute1>
      <attribute2></attribute2>
      <attribute3>NO</attribute3>
      <attribute4>42</attribute4>
      <attribute5>random value</attribute5>
      <attribute6>18th Jun 2014  07:09:18 GMT</attribute6>
    </Element_Name>
    <Element_Name>
      <attribute1>42</attribute1>
      <attribute2></attribute2>
      <attribute3>NO</attribute3>
      <attribute4>42</attribute4>
      <attribute5>random</attribute5>
      <attribute6>23rd Jul 2014  02:47:10 GMT</attribute6>
    </Element_Name>
    <Element_Name>
      <attribute1>42</attribute1>
      <attribute2></attribute2>
      <attribute3>NO</attribute3>
      <attribute4>42</attribute4>
      <attribute5>random</attribute5>
      <attribute6>08th Nov 2014  23:53:31 GMT</attribute6>
    </Element_Name>
  </First_Level_Node>
</Root>

现在我已经从所有元素中获取了一些值并使用它们。

但现在我想只选择具有特定属性值对的元素。

例如在我粘贴的xml中，我只需要使用 attribute4 = 42

获取元素

我目前的代码如下：

tree=ET.parse('xmlname.xml')
root=tree.getroot()
for slot in input_data:
        for child in root[0]:
            for ch in child.findall('First Level Node/*/[@attribute4="' + str(sys.argv[1]) + '"]'):
                print ch
                if ch.tag == slot:
                    if ch.text == 'UNCOMPUTED' or ch.text == None:
                        slot_text.append("Undefined")
                    else:
                        slot_text.append(ch.text)
        data[slot]=Counter(slot_text).most_common()

但我在ch中没有得到任何价值。我尝试过多种相同的变体和我知道的所有Xpath，但仍然没有结果。

非常感谢任何帮助。

注意：Element_Name是动态的，可以更改。

编辑：试过这个，但输出错误的信息。

for slot in input_data:
        for child in root[0]:
            for ch in child:
                if ch.text == '42' and ch.tag == "attribute4":
                    flag=1
                if ch.tag == slot and flag == 1:
                    flag=0
                    if ch.text == 'UNCOMPUTED' or ch.text == None:
                        slot_text.append("Undefined")
                    else:
                        slot_text.append(ch.text)
        data[slot]=Counter(slot_text).most_common()

Answer 1

<attribute4>是XML元素，而不是XML属性。所以，我将尝试的第一件事是以下XPath：

.....
xpath = 'First Level Node/*[attribute4="' + str(sys.argv[1]) + '"]'
for ch in child.findall(xpath):
......

_{*）旁注：“第一级节点”不是有效的XML元素示例，因为它包含空格}

更新：

在您的XML示例的上下文中，child变量已指向<First_Level_Node>，<Root>的子项：

for child in root[0]:

因此，您需要从XPath中删除First_Level_Node：

.....
xpath = '*[attribute4="' + str(sys.argv[1]) + '"]'
for ch in child.findall(xpath):
......

Answer 2

试试这个：

tree=ET.parse('xmlname.xml')
root=tree.getroot()

for first_level_node in root:
    for element_name in first_level_node:
        for attribute in element_name:
            if attribute.tag == "attribute4" and attribute.text == "42":
            # do something

使用python 2.7和XML元素树选择XML中具有特定元素值的节点

2 个答案: