Question

我想在Java中使用XPath读取XML数据。

我有下一个名为MyXML.xml的XML文件：

<?xml version="1.0" encoding="iso-8859-1" ?>
<REPOSITORY xmlns:LIBRARY="http://www.openarchives.org/LIBRARY/2.0/"
            xmlns:xsi="http://www.w3.prg/2001/XMLSchema-instance"
            xsi:schemaLocation="http://www.openarchives.org/LIBRARY/2.0/ http://www.openarchives.org/LIBRARY/2.0/LIBRARY-PHM.xsd">
    <repository>Test</repository>
    <records>
        <record>
            <ejemplar>
                <library_book:book
                        xmlns:library_book="http://www.w3c.es/LIBRARY/book/"
                        xmlns:book="http://www.w3c.es/LIBRARY/book/"
                        xmlns:bookAssets="http://www.w3c.es/LIBRARY/book/"
                        xmlns:bookAsset="http://www.w3c.es/LIBRARY/book/"
                        xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
                        xsi:schemaLocation="http://www.w3c.es/LIBRARY/book/ http://www.w3c.es/LIBRARY/replacement/book.xsd">
                    <book:bookAssets count="1">
                        <book:bookAsset nasset="1">
                            <book:bookAsset.id>value1</book:bookAsset.id>
                            <book:bookAsset.event>
                                <book:bookAsset.event.id>value2</book:bookAsset.event.id>
                            </book:bookAsset.event>
                        </book:bookAsset>
                    </book:bookAssets>
                </library_book:book>
            </ejemplar>
        </record>
    </records>
</REPOSITORY>

我想要访问value1和value2值。为此，我试试这个：

// Standard of reading a XML file
DocumentBuilderFactory factory = DocumentBuilderFactory.newInstance();
factory.setNamespaceAware(true);
DocumentBuilder builder;
Document doc = null;
XPathExpression expr = null;
builder = factory.newDocumentBuilder();
doc = builder.parse("MyXML.xml");

// Create a XPathFactory
XPathFactory xFactory = XPathFactory.newInstance();

// Create a XPath object
XPath xpath = xFactory.newXPath();

expr = xpath.compile("//REPOSITORY/records/record/ejemplar/library_book:book//book:bookAsset.event.id/text()");

Object result = expr.evaluate(doc, XPathConstants.STRING);

System.out.println("RESULT=" + (String)result);

但我没有得到任何结果。仅打印RESULT=。

¿如何访问value1和value2值？ ¿要应用的XPath过滤器是什么？。

先谢谢。

我正在使用JDK6。

Answer 1

您遇到名称空间问题，您可以做的是

考虑到它们
使用XPath local-name()函数

解决方案1意味着实现映射名称空间名称和URI的NamespaceContext，并在查询之前将其设置在XPath对象上。

解决方案2很简单，您只需要更改XPath（但根据您的XML，您可以对您的XPath进行微调以确保选择正确的元素）：

XPath xpath = xFactory.newXPath();
expr = xpath.compile("//*[local-name()='bookAsset.event.id']/text()");
Object result = expr.evaluate(doc, XPathConstants.STRING);
System.out.println("RESULT=" + result);

Runnable example on ideone

您可以查看following blog article以更好地理解Java中命名空间和XPath的用法（即使是旧的）

Answer 2

尝试

Object result = expr.evaluate(doc, XPathConstants.NODESET);

    // Cast the result to a DOM NodeList
    NodeList nodes = (NodeList) result;
    for (int i=0; i<nodes.getLength();i++){
      System.out.println(nodes.item(i).getNodeValue());
    }

Answer 3

一种方法是实现名称空间上下文，如：

public static class UniversalNamespaceResolver implements NamespaceContext {
    private Document sourceDocument;


    public UniversalNamespaceResolver(Document document) {
        sourceDocument = document;
    }


    public String getNamespaceURI(String prefix) {
        if (prefix.equals(XMLConstants.DEFAULT_NS_PREFIX)) {
            return sourceDocument.lookupNamespaceURI(null);
        } else {
            return sourceDocument.lookupNamespaceURI(prefix);
        }
    }


    public String getPrefix(String namespaceURI) {
        return sourceDocument.lookupPrefix(namespaceURI);
    }

    public Iterator getPrefixes(String namespaceURI) {
        return null;
    }

}

然后像

一样使用它

        xpath.setNamespaceContext(new UniversalNamespaceResolver(doc));

您还需要将所有名称空间声明向上移动到根节点（REPOSITORY）。否则，如果在两个不同的级别上有名称空间声明，则可能会出现问题。

如何使用Java中的XPath访问值读取XML

3 个答案: