Question

请注意，我在eclipse和jsoup库中使用java。我的代码是：

Document doc = null;
        String crawUrl = this.getCrawlUrl();
        doc = Jsoup.connect(crawUrl).get();
        Elements hrefs2=doc.select("html");
        System.out.println(hrefs2);

我正在尝试获取特定页面的整个html代码，但是当有div之类的内容时我得不到它。如何从特定页面获取整个HTML代码？

Answer 1

你可以尝试 -

Document doc = Jsoup.connect(crawUrl).get();

System.out.println(doc.toString());

Web爬虫找到整个HTML代码

1 个答案: