Question

我正在尝试阅读xml文件，例如：

<entry>
    <title>FEED TITLE</title>
    <id>5467sdad98787ad3149878sasda</id>
    <tempi type="application/xml">
      <conento xmlns="http://mydomainname.com/xsd/radiofeed.xsd" madeIn="USA" />
    </tempi>
</entry>

这是我到目前为止的代码：

以下是我尝试编写此代码的尝试，该怎么说不成功，这就是为什么我开始赏金。这是http://pastebin.com/huKP4KED。

Bounty更新：

我真的真的试过这几天，现在没想到会这么辛苦，我会接受有用的链接/书籍/教程，但更喜欢代码，因为我昨天需要这样做。

这就是我需要的：

关于上面的xml：

我需要获得title，id
tempi的属性值以及contento的madeIn属性值

这样做的最佳方式是什么？

编辑：

@Pascal Thivent

也许创建方法是个好主意，比如public String getValue（String xml，Element elementname），你指定标签名称，方法返回标签值或标签属性（如果值是，则可以将其命名为附加方法参数）不可用

如果标签值不可用，我真的想获得某些标签值或属性，所以我正在考虑这样做的最佳方法是什么，因为我之前从未这样做过< / p>

Answer 1

最好的解决方案是使用XPath。你的pastebin过期了，但这就是我收集的内容。假设我们有以下feed.xml文件：

<?xml version="1.0" encoding="UTF-8" ?>
<entries>
<entry>
    <title>FEED TITLE 1</title>
    <id>id1</id>
    <tempi type="type1">
      <conento xmlns="dontcare?" madeIn="MadeIn1" />
    </tempi>
</entry>
<entry>
    <title>FEED TITLE 2</title>
    <id>id2</id>
    <tempi type="type2">
      <conento xmlns="dontcare?" madeIn="MadeIn2" />
    </tempi>
</entry>
<entry>
    <id>id3</id>
</entry>
</entries>

这是一个简短但可编译且可运行的概念验证（在同一目录中有feed.xml文件）。

import javax.xml.xpath.*;
import javax.xml.parsers.*;
import org.w3c.dom.*;
import java.io.*;
import java.util.*;

public class XPathTest {
    static class Entry {
        final String title, id, origin, type;
        Entry(String title, String id, String origin, String type) {
            this.title = title;
            this.id = id;
            this.origin = origin;
            this.type = type;
        }
        @Override public String toString() {
            return String.format("%s:%s(%s)[%s]", id, title, origin, type);
        }
    }

    final static XPath xpath = XPathFactory.newInstance().newXPath();
    static String evalString(Node context, String path) throws XPathExpressionException {
        return (String) xpath.evaluate(path, context, XPathConstants.STRING);
    }

    public static void main(String[] args) throws Exception {
        File file = new File("feed.xml");
        Document document = DocumentBuilderFactory.newInstance().newDocumentBuilder().parse(file);
        NodeList entriesNodeList = (NodeList) xpath.evaluate("//entry", document, XPathConstants.NODESET);

        List<Entry> entries = new ArrayList<Entry>();
        for (int i = 0; i < entriesNodeList.getLength(); i++) {
            Node entryNode = entriesNodeList.item(i);
            entries.add(new Entry(
                evalString(entryNode, "title"),
                evalString(entryNode, "id"),
                evalString(entryNode, "tempi/conento/@madeIn"),
                evalString(entryNode, "tempi/@type")
            ));
        }
        for (Entry entry : entries) {
            System.out.println(entry);
        }
    }
}

这会产生以下输出：

id1:FEED TITLE 1(MadeIn1)[type1]
id2:FEED TITLE 2(MadeIn2)[type2]
id3:()[]

请注意，使用XPath可以使值检索变得非常简单，直观，可读且简单，并且还可以优雅地处理“缺失”值。

API链接

Answer 2

使用Element.getAttribute和Element.setAttribute

在您的示例中，((Node) content.item(0)).getFirstChild().getAttributes()。假设content是一个拼写错误，你的意思是contento，getFirstChild正确地返回NULL，因为contento没有子节点。请尝试：((Node) contento.item(0)).getAttributes()。

另一个问题是，在不检查返回值的情况下使用getFirstChild和getChildNodes()[0]，您将面临拾取子文本节点的风险，而不是您想要的元素。

Answer 3

正如所指出的，<contento>没有任何孩子，而不是：

(contento.item(0)).getFirstChild().getAttributes()

您应该将Node视为Element并使用getAttribute（String），如下所示：

((Element)contento.item(0)).getAttribute("madeIn")

以下是您的代码的修改版本（它不是我编写的最强大的代码）：

InputStream inputStream = new ByteArrayInputStream(xml.getBytes());
DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
DocumentBuilder db = dbf.newDocumentBuilder();
Document doc = db.parse(inputStream);
doc.getDocumentElement().normalize();
System.out.println("Root element " + doc.getDocumentElement().getNodeName());
NodeList nodeLst = doc.getElementsByTagName("entry");
System.out.println("Information of all entries");

for (int s = 0; s < nodeLst.getLength(); s++) {

    Node fstNode = nodeLst.item(s);

    if (fstNode.getNodeType() == Node.ELEMENT_NODE) {

        Element fstElmnt = (Element) fstNode;

        NodeList title = fstElmnt.getElementsByTagName("title").item(0).getChildNodes();
        System.out.println("Title : " + (title.item(0)).getNodeValue());

        NodeList id = fstElmnt.getElementsByTagName("id").item(0).getChildNodes();
        System.out.println("Id: " + (id.item(0)).getNodeValue());

        Node tempiNode = fstElmnt.getElementsByTagName("tempi").item(0);
        System.out.println("Type : " + ((Element) tempiNode).getAttribute("type"));

        Node contento = tempiNode.getChildNodes().item(0);
        System.out.println("Made in : " + ((Element) contento).getAttribute("madeIn"));
    }
}

在XML片段上运行它会产生以下输出：

Root element entry
Information of all entries
Title : FEED TITLE
Id: 5467sdad98787ad3149878sasda
Type : application/xml
Made in : USA

顺便说一下，您是否考虑使用类似Rome的内容？

XML子节点属性值

3 个答案:

API链接