Python etree XSLT是否需要Tag输出?

时间:2016-03-19 22:19:39

标签: python xml csv xslt elementtree

我正在尝试制作一个简单的XML - > CSV脚本,使用XSLT。我发现etree似乎“想要”标签输出......有没有人知道一个解决方法?是的,我看过这篇文章:XML to CSV Using XSLT

见下文......

这是一个仅供参考的XML数据示例。我的代码甚至没有对数据做任何事情,因为它甚至没有写入标题。

    <projects>
    <project>
    <name>Shockwave</name> 
    <language>Ruby</language> 
    <owner>Brian May</owner> 
    <state>New</state> 
    <startDate>31/10/2008 0:00:00</startDate> 
    </project>
    <project>
    <name>Other</name> 
    <language>Erlang</language> 
    <owner>Takashi Miike</owner> 
    <state> Canceled </state> 
    <startDate>07/11/2008 0:00:00</startDate> 
    </project>
    </projects>

这是我的剧本:

    import sys
    from lxml import etree

    system_file = sys.argv[1]
    xml_file = sys.argv[2]

    sys_txt = open( system_file,"r" ).read()
    xsl_txt = open( "csv_file.xslt","r" ).read()


    sysroot = etree.fromstring( sys_txt )
    xslroot = etree.fromstring( xsl_txt )
    transform = etree.XSLT( xslroot )

    with open( xml_file, "w" ) as f:
        f.write(etree.tostring( transform(sysroot) ) )

此XSLT代码不起作用(etree.tostring ... = None):

    <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
    <xsl:output method="text"/>
    <xsl:template match="/">
    Hi
    </xsl:template>

    </xsl:stylesheet>

但是这个XSLT确实有用......似乎etree需要输出一个XML文件?

    <xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
    <xsl:output method="text"/>
    <xsl:template match="/">
    <dummy>
    Hi
    </dummy>
    </xsl:template>

    </xsl:stylesheet>

此时我想我可以继续使用虚拟标签,然后在结束时删除它......

1 个答案:

答案 0 :(得分:1)

  

“Python etree XSLT需要标记输出吗?”

答案是

the documentation中所示, XSLT结果对象部分;你可以使用标准的python str()函数来获得转换结果的预期字符串表示,特别是当它没有根元素时:

from lxml import etree

raw_xml = '''<projects> 
  <project> 
    <name>Shockwave</name>  
    <language>Ruby</language>  
    <owner>Brian May</owner>  
    <state>New</state>  
    <startDate>31/10/2008 0:00:00</startDate> 
  </project>  
  <project> 
    <name>Other</name>  
    <language>Erlang</language>  
    <owner>Takashi Miike</owner>  
    <state>Canceled</state>  
    <startDate>07/11/2008 0:00:00</startDate> 
  </project> 
</projects>'''
raw_xslt = '''<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="1.0">  
  <xsl:output method="text"/>  
  <xsl:template match="/"> 
    <xsl:text>Hi</xsl:text> 
  </xsl:template> 
</xsl:stylesheet>'''

sysroot = etree.fromstring(raw_xml)
xslroot = etree.fromstring(raw_xslt)
transform = etree.XSLT(xslroot)

print str(transform(sysroot))
# output:
# Hi

正如您所见,当转换结果具有根元素时,etree.tostring()仍然可用。