在XML上应用XSLT以根据同一文件中的标记匹配过滤信息

时间:2014-09-26 17:24:14

标签: xml xslt

我是XML和XSLT的新手,我想从XML文件中过滤一些信息。基于XML文件中某些标记值的匹配。

这是我的XML文件,如下所示:

<?xml version="1.0" encoding="UTF-8"?>
<People>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <tag3>not important info</tag3>
    <tag4>not important info</tag4>
    <first-name>Mike</first-name>
    <last-name>Hewitt</last-name>
    <licenses>
        <license>
            <number>938387</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
        <license>
            <number>938387</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
    <appointments>
        <appointment-info>
            <code>5124</code>
            <number>14920329324</number>
            <licensed-states>
                <state>TX</state>
            </licensed-states>
        </appointment-info>
    </appointments>
</Person>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <tag3>not important info</tag3>
    <tag4>not important info</tag4>
    <first-name>John</first-name>
    <last-name>Jhonny</last-name>
    <licenses>
        <license>
            <number>1762539</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
        <license>
            <number>1762539</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
    <appointments>
        <appointment-info>
            <code>5124</code>
            <number>14920329324</number>
            <licensed-states>
                <state>TX</state>
            </licensed-states>
        </appointment-info>
    </appointments>
</Person>
    <Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <tag3>not important info</tag3>
    <tag4>not important info</tag4>
    <first-name>Mike</first-name>
    <last-name>Hewitt</last-name>
    <licenses>
        <license>
            <number>17294083</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
    <appointments>
        <appointment-info>
            <code>5124</code>
            <number>14920329324</number>
            <licensed-states>
                <state>IL</state>
            </licensed-states>
        </appointment-info>
    </appointments>
</Person>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <tag3>not important info</tag3>
    <tag4>not important info</tag4>
    <first-name>John</first-name>
    <last-name>Jhonny</last-name>
    <licenses>
        <license>
            <number>840790</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
        <license>
            <number>840790</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
        <license>
            <number>840790</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">CA</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
    <appointments>
        <appointment-info>
            <code>5124</code>
            <number>14920329324</number>
            <licensed-states>
                <state>TX</state>
                <state>NY</state>
            </licensed-states>
        </appointment-info>
    </appointments>
</Person>
</People>

我基本上想做的是,如果一个人在一个州获得许可,例如TX。并且在该状态下具有预约信息,例如TX,从许可证中过滤掉。如果这是唯一的许可证信息,则过滤该人员。

新xml应包含所需标签的信息。并且只有与预约许可证中的许可证不匹配的许可证才会声明。匹配所有许可证的过滤人员。

<?xml version="1.0" encoding="UTF-8"?>
<People>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <first-name>Mike</first-name>
    <last-name>Hewitt</last-name>
    <licenses>
        <license>
            <number>938387</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
</Person>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <first-name>John</first-name>
    <last-name>Jhonny</last-name>
    <licenses>
        <license>
            <number>1762539</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
</Person>
<Person>
    <required-tag1>some-information</required-tag1>
    <required-tag2>some-information</required-tag2>
    <first-name>John</first-name>
    <last-name>Jhonny</last-name>
    <licenses>
        <license>
            <number>840790</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">CA</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
        </license>
    </licenses>
</Person>
</People>

如何编写XSLT来过滤此信息。我正在使用XSLT版本1.0

目前,我可以应用此XSLT来获取转换所需的标记。 但我不知道如何过滤许可证状态:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output encoding="UTF-8" indent="yes" method="xml"/>
<xsl:strip-space elements="*"/>
<xsl:template match="/People">
  <People>
  <xsl:apply-templates select="Person"/>
  </People>
 </xsl:template>
 <xsl:template match="Person">
   <Person>
  <xsl:copy-of select="required-tag1"/>
  <xsl:copy-of select="required-tag2"/>
  <xsl:copy-of select="first-name"/>
  <xsl:copy-of select="last-name"/>
</Person>
</xsl:template>
</xsl:stylesheet>

1 个答案:

答案 0 :(得分:1)

与大多数XSLT一样,以identity transform开头,然后覆盖它。

您可以通过仅覆盖licensestate中的state匹配的licensed-states来过滤掉许可。

XML输入

<People>
    <Person>
        <required-tag1>some-information</required-tag1>
        <required-tag2>some-information</required-tag2>
        <tag3>not important info</tag3>
        <tag4>not important info</tag4>
        <first-name>Mike</first-name>
        <last-name>Hewitt</last-name>
        <licenses>
            <license>
                <number>938387</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
            <license>
                <number>938387</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
        </licenses>
        <appointments>
            <appointment-info>
                <code>5124</code>
                <number>14920329324</number>
                <licensed-states>
                    <state>TX</state>
                </licensed-states>
            </appointment-info>
        </appointments>
    </Person>
    <Person>
        <required-tag1>some-information</required-tag1>
        <required-tag2>some-information</required-tag2>
        <tag3>not important info</tag3>
        <tag4>not important info</tag4>
        <first-name>John</first-name>
        <last-name>Jhonny</last-name>
        <licenses>
            <license>
                <number>1762539</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
            <license>
                <number>1762539</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
        </licenses>
        <appointments>
            <appointment-info>
                <code>5124</code>
                <number>14920329324</number>
                <licensed-states>
                    <state>TX</state>
                </licensed-states>
            </appointment-info>
        </appointments>
    </Person>
    <Person>
        <required-tag1>some-information</required-tag1>
        <required-tag2>some-information</required-tag2>
        <tag3>not important info</tag3>
        <tag4>not important info</tag4>
        <first-name>Mike</first-name>
        <last-name>Hewitt</last-name>
        <licenses>
            <license>
                <number>17294083</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
        </licenses>
        <appointments>
            <appointment-info>
                <code>5124</code>
                <number>14920329324</number>
                <licensed-states>
                    <state>IL</state>
                </licensed-states>
            </appointment-info>
        </appointments>
    </Person>
    <Person>
        <required-tag1>some-information</required-tag1>
        <required-tag2>some-information</required-tag2>
        <tag3>not important info</tag3>
        <tag4>not important info</tag4>
        <first-name>John</first-name>
        <last-name>Jhonny</last-name>
        <licenses>
            <license>
                <number>840790</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">TX</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
            <license>
                <number>840790</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
            <license>
                <number>840790</number>
                <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">CA</state>
                <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
            </license>
        </licenses>
        <appointments>
            <appointment-info>
                <code>5124</code>
                <number>14920329324</number>
                <licensed-states>
                    <state>TX</state>
                    <state>NY</state>
                </licensed-states>
            </appointment-info>
        </appointments>
    </Person>
</People>

XSLT 1.0

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
    <xsl:output indent="yes"/>
    <xsl:strip-space elements="*"/>

    <!--Identity transform (aka identity template). This will match
    and copy attributes and nodes (element, text, comment and
    processing-instruction) without changing them. Unless a more
    specific template matches, everything will get handled by this
    template.-->
    <xsl:template match="@*|node()">
        <xsl:copy>
            <xsl:apply-templates select="@*|node()"/>
        </xsl:copy>
    </xsl:template>

    <!--This template will match all "appointments", "tag3", "tag4" element nodes.
    It will also match "license" element nodes that have a child "state"
    element whose value matches a "state" element node that is a child of 
    "licensed-states".
    It will also match the "Person" element node if the number of
    "state" elements that don't have a corresponding "licensed-state"
    is equal to zero. ("filtered person who matched all licenses"
    requirement.)
    Instead of writing 4 individual xsl:templates, I used
    the union "|" operator in the "match" attribute. Since the "xsl:template" is 
    empty, nothing is output or processed further.-->
    <xsl:template match="appointments|license[state=../..//licensed-states/state]|tag3|
    tag4|Person[count(licenses/license[not(state=../..//licensed-states/state)])=0]"/>

</xsl:stylesheet>

XML输出

<People>
   <Person>
      <required-tag1>some-information</required-tag1>
      <required-tag2>some-information</required-tag2>
      <first-name>Mike</first-name>
      <last-name>Hewitt</last-name>
      <licenses>
         <license>
            <number>938387</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">IL</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
         </license>
      </licenses>
   </Person>
   <Person>
      <required-tag1>some-information</required-tag1>
      <required-tag2>some-information</required-tag2>
      <first-name>John</first-name>
      <last-name>Jhonny</last-name>
      <licenses>
         <license>
            <number>1762539</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">NY</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
         </license>
      </licenses>
   </Person>
   <Person>
      <required-tag1>some-information</required-tag1>
      <required-tag2>some-information</required-tag2>
      <first-name>John</first-name>
      <last-name>Jhonny</last-name>
      <licenses>
         <license>
            <number>840790</number>
            <state xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">CA</state>
            <field xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">Health</field>
         </license>
      </licenses>
   </Person>
</People>

如果您最终过滤掉的节点多于您保留的节点,您可以将其切换为xsl:apply-templates以处理更多过滤...

XSLT 1.0 (与上述输出相同)

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
    <xsl:output indent="yes"/>
    <xsl:strip-space elements="*"/>

    <!--Identity transform (aka identity template). This will match
    and copy attributes and nodes (element, text, comment and
    processing-instruction) without changing them. Unless a more
    specific template matches, everything will get handled by this
    template.-->    
    <xsl:template match="@*|node()">
        <xsl:copy>
            <xsl:apply-templates select="@*|node()"/>
        </xsl:copy>
    </xsl:template>

    <!--This template will match the "Person" element node. The "xsl:copy"
    creates the new "Person" element. The "xsl:apply-templates" tells
    the processor to apply templates to any attributes (of Person) or
    elements listed in the "select". (Other elements will not be 
    processed.) I used the union operator in the "select" so I wouldn't
    have to write multiple "xsl:apply-templates".-->
    <xsl:template match="Person">
        <xsl:copy>
            <xsl:apply-templates select="@*|first-name|last-name|
                required-tag1|required-tag2|licenses"/>
        </xsl:copy>
    </xsl:template>

    <!--This template will match any "license" element nodes that have a child 
    "state" element whose value matches a "state" element node that is a 
    child of "licensed-states". 
    This template will also match the "Person" element node if the number of
    "state" elements that don't have a corresponding "licensed-state"
    is equal to zero. ("filtered person who matched all licenses"
    requirement.)
    Since the "xsl:template" is empty, nothing 
    is output or processed further.-->
    <xsl:template match="license[state=../..//licensed-states/state]|
    Person[count(licenses/license[not(state=../..//licensed-states/state)])=0]"/>

</xsl:stylesheet>