我需要根据属性值将基于html的结构转换为xml文档。下面我提到了输入结构。
<body>
<p class='h1'>the fisr A</p>
<p class='txt'>one</p>
<p>tow</p>
<p class='h2' status='remove'></p>
<p class='h3'>the sec sec B</p>
<p class='txt'>the next text</p>
<p class='h3'>the fisr C</p>
<p class='txt'>four</p>
<p class='txt'>five</p>
<p class='h1' status="remove">the seccond A</p>
<p class='txt'>the seccond txt</p>
<p class='h2'>the second B</p>
<p class='txt'>six</p>
<p class='txt'>seven</p>
<p class='h1' status="remove">the third A</p>
<p class='txt'>eight</p>
<p class='h2' status="remove">the third A</p>
<p class='h3'>the third A</p>
<p class='txt'>the third A</p>
</body>
我的预期输出如下。在这里,我需要根据h1,h2,h3对元素进行分组。但是条件是将元素分组后,我们必须删除属性状态为值“ remove”的元素。
<book>
<sectionA>
<title>the fisr A</title>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">one</p>
<p xmlns="http://www.w3.org/1999/xhtml">tow</p>
<sectionC>
<title>the sec sec B</title>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">the next text</p>
</sectionC>
<sectionC>
<title>the fisr C</title>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">four</p>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">five</p>
</sectionC>
</sectionA>
<sectionB>
<title>the second B</title>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">six</p>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">seven</p>
</sectionB>
<sectionC>
<title>the third A</title>
<p xmlns="http://www.w3.org/1999/xhtml" class="txt">the third A</p>
</sectionC>
</book>
我尝试使用下面的xslt。我处理了变量内部的分组,然后尝试删除具有“状态”属性的标题。但这是有效的。
<xsl:template match="body">
<xsl:variable name="sequence">
<book>
<xsl:for-each-group select="p" group-starting-with="p[@class='h1']">
<sectionA>
<xsl:copy-of select="@*"></xsl:copy-of>
<title>
<xsl:value-of select="node()"/>
</title>
<xsl:for-each-group select="current-group() except ." group-starting-with="p[@class='h2']">
<xsl:choose>
<xsl:when test="self::p[@class='h2']">
<sectionB>
<xsl:copy-of select="@*"></xsl:copy-of>
<title>
<xsl:value-of select="node()"/>
</title>
<xsl:for-each-group select="current-group() except ." group-starting-with="p[@class='h3']">
<xsl:choose>
<xsl:when test="self::p[@class='h3']">
<sectionC>
<xsl:copy-of select="@*"></xsl:copy-of>
<title>
<xsl:value-of select="node()"/>
</title>
<xsl:apply-templates select="current-group() except ."></xsl:apply-templates>
</sectionC>
</xsl:when>
<xsl:otherwise>
<xsl:apply-templates select="current-group()"></xsl:apply-templates>
</xsl:otherwise>
</xsl:choose>
</xsl:for-each-group>
</sectionB>
</xsl:when>
<xsl:otherwise>
<xsl:apply-templates select="current-group()"></xsl:apply-templates>
</xsl:otherwise>
</xsl:choose>
</xsl:for-each-group>
</sectionA>
</xsl:for-each-group>
</book>
</xsl:variable>
<xsl:variable name="modifiedseq">
<xsl:apply-templates select="$sequence/node()"></xsl:apply-templates>
</xsl:variable>
<xsl:apply-templates select="$modifiedseq"></xsl:apply-templates>
</xsl:template>
<xsl:template match="p">
<xsl:copy>
<xsl:copy-of select="@*"/>
<xsl:apply-templates select="node()"/>
</xsl:copy>
</xsl:template>
分组的顺序为h1,h2,h3。假设如果在h2中存在状态status ='remove',则序列为h1,h2。请有人尝试帮助我。