合并多个xml文件时,如何为父xml以及子xml文件设置EntityResolver?

时间:2019-03-24 15:47:51

标签: xslt entityresolver

我有一个参考其他xml文件的书xml文件。当我尝试在book.xml文件上运行xslt时,代码中的EntityResolver解析了dtd路径。但是,对于正在合并的子级xml文件,无法解析dtd路径。

样本sample_book.ditamap

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE bookmap PUBLIC "-//OASIS//DTD DITA BookMap//EN" "bookmap.dtd">
<bookmap>
    <booktitle>
        <mainbooktitle>sample book</mainbooktitle>
    </booktitle>
    <part navtitle="Overview">
        <topicref href="../topics/introduction.dita"/>
        <topicref href="../topics/install.dita"/>
    </part>
</bookmap>

`

Java代码

public class XMLProcessor  {

       public void transform(String xmlf, String xslf) throws TransformerConfigurationException, TransformerException, org.xml.sax.SAXException, IOException{

        org.xml.sax.XMLReader reader = XMLReaderFactory.createXMLReader();
        Transformer transformer;
        TransformerFactory factory = TransformerFactory.newInstance();

         StreamSource stylesheet = new StreamSource(xslf);

         //Source source = StreamSource(xmlf);
         //SAXSource source = new SAXSource(new InputSource(xmlf));
        // org.xml.sax.XMLReader reader = XMLReaderFactory.createXMLReader();

         EntityResolver ent = new EntityResolver() {

            @Override
            public InputSource resolveEntity(String publicId, String systemId) throws SAXException, IOException {

                System.out.println(publicId);
                System.out.println(systemId);

                if(publicId.equals("-//OASIS//DTD DITA BookMap//EN")){
                    return new InputSource("file:///D:/sample/dtd/bookmap.dtd");
                }

                return null;
                }
            };
        // sour.setPublicId("file:///D:/sample/dtd/bookmap/dtd/bookmap.dtd");


            reader.setEntityResolver(ent);
            SAXSource source = new SAXSource(reader, new InputSource(xmlf));
            //reader.parse(new InputSource(xmlf));
         //StreamSource sourcedoc = new StreamSource(xmlf);
         transformer = factory.newTransformer(stylesheet);

        try {
            transformer.transform(source, new StreamResult(new FileWriter("D:\\sample\\out\\result.xml")));
        } catch (IOException ex) {
            Logger.getLogger(XMLProcessor.class.getName()).log(Level.SEVERE, null, ex);
        }

    }





}`

预期结果

`

<?xml version="1.0" encoding="UTF-8"?>
<pages xmlns:mf="urn:mf">
    <parentpage>
        <parentpagename>sample book</parentpagename>
    </parentpage>
    <part>
        <partname>Overview</partname>
        <text/>
        <level-2>
            <concept xmlns:ditaarch="http://dita.oasis-open.org/architecture/2005/"
                id="introduction" ditaarch:DITAArchVersion="1.3"
                domains="(topic concept)                            (topic abbrev-d)                            a(props deliveryTarget)                            (topic equation-d)                            (topic hazard-d)                            (topic hi-d)                            (topic indexing-d)                            (topic markup-d)                            (topic mathml-d)                            (topic pr-d)                            (topic relmgmt-d)                            (topic sw-d)                            (topic svg-d)                            (topic ui-d)                            (topic ut-d)                            (topic markup-d xml-d)   "
                class="- topic/topic concept/concept ">
                <title class="- topic/title ">Introduction</title>
                <shortdesc class="- topic/shortdesc "/>
                <conbody class="- topic/body  concept/conbody ">
                    <p class="- topic/p ">Sample introduction</p>
                </conbody>
            </concept>
        </level-2>
        <text/>
        <level-2>
            <task xmlns:ditaarch="http://dita.oasis-open.org/architecture/2005/" id="install"
                ditaarch:DITAArchVersion="1.3"
                domains="(topic task)                            (topic abbrev-d)                            a(props deliveryTarget)                            (topic equation-d)                            (topic hazard-d)                            (topic hi-d)                            (topic indexing-d)                            (topic markup-d)                            (topic mathml-d)                            (topic pr-d)                            (topic relmgmt-d)                            (topic sw-d)                            (topic svg-d)                            (topic ui-d)                            (topic ut-d)                            (topic markup-d xml-d)                            (topic task strictTaskbody-c)   "
                class="- topic/topic task/task ">
                <title class="- topic/title ">Install</title>
                <shortdesc class="- topic/shortdesc "/>
                <taskbody class="- topic/body task/taskbody ">
                    <context class="- topic/section task/context ">
                        <p class="- topic/p ">Download xyz installer from here. </p>
                    </context>
                    <steps class="- topic/ol task/steps ">
                        <step class="- topic/li task/step ">
                            <cmd class="- topic/ph task/cmd ">Double-click the downloader
                                installer.</cmd>
                        </step>
                        <step class="- topic/li task/step ">
                            <cmd class="- topic/ph task/cmd ">Do this.</cmd>
                        </step>
                        <step class="- topic/li task/step ">
                            <cmd class="- topic/ph task/cmd ">Do that</cmd>
                        </step>
                    </steps>
                </taskbody>
            </task>
        </level-2>
    </part>
</pages>

`

实际结果

运行XSLT时,显示以下错误消息。当我将dtd文件移动到主题文件夹时,错误消失了。

Warning   XTDE0540: Ambiguous rule match for /bookmap/booktitle[1] Matches both "element(Q{}booktitle)" on line 54 of
>file:///D:/sample/xsl/merge.xsl and "element(Q{}booktitle)" on line 18 of >file:///D:/sample/xsl/merge.xsl Warning at char 11 in
     

xsl:apply-templates / @ select> merge.xsl的第30行:   FODC0002:XML解析器处理报告的I / O错误   文件:/ D:/sample/sampledoc/topics/introduction.dita:   D:\ sample \ sampledoc \ topics \ concept.dtd(系统找不到

     
    已指定

文件)在xsl:apply-templates / @ select的char 11处发出警告,此行> merge.xsl的第104列:FODC0002:I / O错误     XML解析器处理文件:/ D:/sample/sampledoc/topics/install.dita:     D:\ sample \ sampledoc \ topics \ task.dtd(系统找不到> file     指定)

  

2 个答案:

答案 0 :(得分:1)

您可以在Transformer上设置URIResolver,当您的XSLT代码调用doc()或document()来获取引用的XML文件时,将调用URIResolver。然后,URIResolver可以在用于解析这些文件的XML解析器上设置EntityResolver。

或者,您可以使用Apache XMLResolver来完成所有这些工作,该Apache XMLResolver通过引用OASIS定义的格式的目录文件在XSLT和XML级别上同时引用URI。

答案 1 :(得分:0)

感谢迈克尔,我可以解决此问题。

  1. 在类路径中添加了CatalogManager.properties文件。

  2. 使用所有公共ID创建了catalog.xml。

    public class XMLProcessor  {
    
       public void transform(String xmlf, String xslf, String outpath) throws TransformerConfigurationException, TransformerException, org.xml.sax.SAXException, IOException{
    
        org.xml.sax.XMLReader reader = XMLReaderFactory.createXMLReader();
        Transformer transformer = null;
        TransformerFactory factory = TransformerFactory.newInstance();
    
         StreamSource stylesheet = new StreamSource(xslf);
    
            CatalogResolver cr = new CatalogResolver();
            reader.setEntityResolver(cr);
            factory.setURIResolver(cr);
    
            SAXSource source = new SAXSource(reader, new InputSource(xmlf));
    
         transformer = factory.newTransformer(stylesheet);
    
        try {
            transformer.transform(source, new StreamResult(new FileWriter(outpath)));
        } catch (IOException ex) {
            Logger.getLogger(XMLProcessor.class.getName()).log(Level.SEVERE, null, ex);
        }
    
    }
    

    }