Question

我想读取xml文件的最后一个条目并获取其值。这是我的xml文件

<TestSuite>
  <TestCase>
    <name>tcname1</name>
    <total>1</total>
    <totalpass>0</totalpass>
    <totalfail>0</totalfail>
    <totalerror>1</totalerror>
  </TestCase>
  <TestCase>
    <name>tcname2</name>
    <total>1</total>
    <totalpass>0</totalpass>
    <totalfail>0</totalfail>
    <totalerror>1</totalerror>
  </TestCase>
</TestSuite>

我想在文件的最后一个标记中获取<total>，<totalpass>，<totalfail>和<totalerror>值。我已经尝试过这段代码。

import xmltodict
with open(filename) as fd:
    doc = xmltodict.parse(fd.read())
    length=len(doc['TestSuite']['TestCase'])
    tp=doc['TestSuite']['TestCase'][length-1]['totalpass']
    tf=doc['TestSuite']['TestCase'][length-1]['totalfail']
    te=doc['TestSuite']['TestCase'][length-1]['totalerror']
    total=doc['TestSuite']['TestCase'][length-1]['total']

这适用于xml文件中包含2个或更多测试用例标记的xml，但对于只有一个测试用例标记的文件会出现此错误。

Traceback (most recent call last):
  File "HTMLReportGenerationFromXML.py", line 52, in <module>
    tp=doc['TestSuite']['TestCase'][length-1]['totalpass']
KeyError: 4 .

因为它不是计数，而是取子标签（等值作为长度）。请帮我解决这个问题。

Answer 1

为什么我不首先做他的事！使用xpath。

第一个示例涉及仅使用一个TestCase元素处理xml文件，第二个示例包含两个TestCase元素。关键是使用xpath last选择器。

>>> from lxml import etree
>>> tree = etree.parse('temp.xml')
>>> last_TestCase = tree.xpath('.//TestCase[last()]')[0]
>>> for child in last_TestCase.iterchildren():
...     child.tag, child.text
... 
('name', 'tcname2')
('total', '1')
('totalpass', '0')
('totalfail', '0')
('totalerror', '1')
>>> 
>>> tree = etree.parse('temp_2.xml')
>>> last_TestCase = tree.xpath('.//TestCase[last()]')[0]
>>> for child in last_TestCase.iterchildren():
...     child.tag, child.text
... 
('name', 'tcname1')
('reason', 'reason')
('total', '2')
('totalpass', '0')
('totalfail', '0')
('totalerror', '2')

Answer 2

由于您只想要最后一个，您可以使用负索引来检索它：

import xml.etree.ElementTree as et

tree = et.parse('test.xml')

# collect all the test cases
test_cases = [test_case for test_case in tree.findall('TestCase')]

# Pull data from the last one
last = test_cases[-1]
total = last.find('total').text
totalpass = last.find('totalpass').text
totalfail = last.find('totalfail').text
totalerror = last.find('totalerror').text

print total,totalpass,totalfail,totalerror

Answer 3

您的错误原因在于<PropertyGroup Label="Globals"> xmltidict是一个仅适用于长XML的列表

doc['TestSuite']['TestCase']

但它只是一个单项长文件的字典：

>>> type(doc2['TestSuite']['TestCase']) # here doc2 is more than one-entry long XML file 
>>> list

这就是原因。您可以尝试以下列方式管理问题：

>>> type(doc['TestSuite']['TestCase']) # doc is one-entry long 
>>> collections.OrderedDict

否则，您可以使用另一个库进行XML解析。

...让我知道它是否有帮助！

Answer 4

我试过这个对我有用

import xml.etree.ElementTree as ET
import sys
tree = ET.parse('temp.xml')
root = tree.getroot()
print root
total=[]
totalpass=[]
totalfail=[]
totalerror=[]
for test in root.findall('TestCase'):
    total.append(test.find('total').text)
    totalpass.append(test.find('totalpass').text)
    totalfail.append(test.find('totalfail').text)
    totalerror.append(test.find('totalerror').text)
length=len(total)
print total[length-1],totalpass[length-1],totalfail[length-1],totalerror[length-1]

这个适用于我

XML在python中读取文件中的最后一个条目

4 个答案: