Question

假设这是我用来解析java中的jsoup的HTML

<div id = "report-content">
    <h2>Name <div>or a nick name</div></h2>
    Doe
    <h2>Occupation</h2>
    Singer
    <h2>Lives in</h2>
    Honkong
 </div>

如何获取

等数据

Name
Doe

Occupation
Singer

Lives in 
Honkong

在jsoup？

Answer 1

查看以下代码段：

Document document = Jsoup.parse("<div id = \"report-content\">\r\n" + "    <h2>Name <div>or a nick name</div></h2>\r\n"
        + "    Doe\r\n" + "    <h2>Occupation</h2>\r\n" + "    Singer\r\n" + "    <h2>Lives in</h2>\r\n" + "    Honkong\r\n"
        + " </div>");

Element rootElement = document.getElementById("report-content");
for (Node node : rootElement.childNodes()) {
    if (node instanceof TextNode) {
        // print text after h2 tag
        System.out.println(((TextNode) node).text().trim());
    } else if (node instanceof Element) {
        // print text of the h2 tag without inner elements (div)
        Element element = (Element) node;
        if ("h2".equals(element.tagName())) {
            System.out.println(element.ownText());
        }
    }
}

jsoup：在一些标签之后解析数据（h2）

1 个答案: