Question

我写了一个解析facebook的程序，我已经可以得到整个DOM树了。事情进展顺利，但当我想选择所有<p> - 标签时，问题是它返回一个零大小的数组。 PS：当我解析其他网站而不是facebook时，没有什么问题。

这是我的代码：

public static void main(String[] args) throws IOException {
    doc = connect(); //connect the website,
    System.out.print(doc.outerHtml());//in the wole html file, i can find the tag <p>
    newsHeadlines = doc.select("p"); //nothing
    doc.getElementsByTag("p");//nothing either
    oldEleStr = newsHeadlines.text();
    System.out.println(oldEleStr);//nothing
}


static Document connect() throws IOException {
    org.jsoup.Connection connection = Jsoup
            .connect("facebook.com")
            .cookies(
                    splitCookies(facebookCookies));
    Document doc = connection.get();
    return doc;
}

Answer 1

您可能想要尝试以下内容：

Document new_doc = Jsoup.parse(doc.outerHtml());
Elements elements = doc.select("p");
for (Element aa : elements) {
    //TODO:
}

使用Jsoup解析facebook的问题

1 个答案: