Question

我有一些具有以下结构的XML：

       <root>
           <parent-1>
              <text>blah-1</text>
              <properties>
                 <property type="R" id="0005">text-value-A</property>
                 <property type="W" id="0003">text-value-B</property>
                 <property type="H" id="0002">text-value-C</property>
                 <property type="W" id="0008">text-value-D</property>
              </properties>
           </parent-1>
           <parent-2>
              <text>blah-2</text>
              <properties>
                 <property type="W" id="0004">text-value-A</property>
                 <property type="H" id="0087">text-value-B</property>
              </properties>
           </parent-2>
           <parent-3>
              <text>blah-3</text>
              <properties>
                 <property type="H" id="0087">text-value-C</property>
                 <property type="R" id="0008">text-value-A</property>
              </properties>
           </parent-3>
           <parent-4>
              <text>blah-4</text>
              <properties>
                 <property type="H" id="0019">text-value-C</property>
                 <property type="R" id="0060">text-value-A</property>
              </properties>
           </parent-4>
       </root>

目前，我正在解析text-value-并使用某个字符串!加入它们，但对于属性级别中最后发生的text-value-X，我需要分配一些其他字符串&，并输出如下内容： text-value-A!text-value-B!text-value-C!text-value-D&text-value-A!text-value-B&text-value-C!text-value-A。

由于<property中的属性无法特定于代码/具有随机值，因此if(item.text == 'text-value-A') #get text-value-A of parent-3之类的内容将无效。

----------

我没有保留重复text-value- s（在这种情况下不需要parent-4，因为text-value-的{{1}}是完全相同的）我想保留顺序，所以与parent-3我正在做以下事情：

enumerate

鉴于上面所需的输出，我想知道我是否需要一个不同的方法解决这个问题，或者像下面这样的概念会以某种方式工作：

alist = []
for item in root.findall('parent/properties/property'):
   alist.append(item.text)
self.alist = '!'.join([a for b,a in enumerate(alist) if a not in alist[:b]]

由于

Answer 1

这可能是你想要的。

xpath公式'.//properties'生成一个包含四个元素的列表。
property_texts将包含每个文本的列表。
any谓词用于测试之前是否已经看到当前属性的文本集。如果没有，则将这些文本作为列表添加到集合中。（使用set逻辑来避免在不同的订单中丢失重复集很重要。）

from xml.etree import ElementTree tree = ElementTree.parse('bt123.xml') property_text_lists = [] for properties in tree.findall('.//properties'): property_texts = [p.text for p in properties] if any([set(property_texts)==set(ptl) for ptl in property_text_lists]): break property_text_lists.append(property_texts) print ('&'.join(['!'.join(property_text_lists[i]) for i in range(len(property_text_lists))]))

它确实产生了这个输出。

text-value-A!text-value-B!text-value-C!text-value-D&text-value-A!text-value-B&text-value-C!text-value-A

获取ElementTree中的最后一个标记并附加文本

----------

1 个答案: