使用Java在文档中的任何位置定位XML元素

时间:2015-10-28 21:09:08

标签: java xml xpath

给出以下XML(示例):

<?xml version="1.0" encoding="UTF-8"?>
<rsb:VersionInfo xmlns:atom="http://www.w3.org/2005/Atom" xmlns:rsb="http://ws.rsb.de/v2">
    <rsb:Variant>Windows</rsb:Variant>
    <rsb:Version>10</rsb:Version>
</rsb:VersionInfo>

我需要获取VariantVersion的值。我目前的方法是使用XPath,因为我不能依赖给定的结构。我所知道的是文档中某处有一个元素rsb:Version

XPath xpath = XPathFactory.newInstance().newXPath();
String expression = "//Variant";
InputSource inputSource = new InputSource("test.xml");
String result = (String) xpath.evaluate(expression, inputSource, XPathConstants.STRING);
System.out.println(result);

然而,这不输出任何东西。我尝试了以下XPath表达式:

  • //变
  • //变体/文本()
  • // RSB:变体
  • // RSB:变体/文本()

什么是正确的XPath表达式?或者是否有更简单的方法来获得这个元素?

1 个答案:

答案 0 :(得分:3)

我建议只循环浏览文档以找到给定的标记

public static void main(String[] args) throws SAXException, IOException,ParserConfigurationException, TransformerException {

    DocumentBuilderFactory docBuilderFactory = DocumentBuilderFactory
            .newInstance();
    DocumentBuilder docBuilder = docBuilderFactory.newDocumentBuilder();
    Document document = docBuilder.parse(new File("test.xml"));

    NodeList nodeList = document.getElementsByTagName("rsb:VersionInfo");
    for (int i = 0; i < nodeList.getLength(); i++) {
        Node node = nodeList.item(i);
        if (node.getNodeType() == Node.ELEMENT_NODE) {
            // do something with the current element
            System.out.println(node.getNodeName());
        }
    }
}

编辑:Yassin指出它不会获得子节点。这应该指出你正确的方向让孩子们。

private static List<Node> getChildren(Node n)
  {
    List<Node> children = asList(n.getChildNodes());
    Iterator<Node> it = children.iterator();
    while (it.hasNext())
      if (it.next().getNodeType() != Node.ELEMENT_NODE)
        it.remove();
    return children;
  }