Question

我正在使用Jsoup来解析一些Web检索页面。出于某些原因，Jsoup在尝试解析http://higherperspectives.com/keanu-reeves-life/

时的内容时会陷入无限循环

以下是使用的代码：

Document document = Jsoup.connect(searchResult.get("link"))
            .timeout(500).followRedirects(false).validateTLSCertificates(false)
            .userAgent("Mozilla/5.0 (Windows NT 6.2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/30.0.1599.69 Safari/537.36")
            .maxBodySize(0).get();

Elements classElements = document.select("*:containsOwn(${owlSearchTerm.get("class")})");
Elements relationElements = document.select("*:containsOwn(${owlSearchTerm.get("relation")})");
Elements individualElements = document.select("*:containsOwn(${owlSearchTerm.get("individual")})");

connect方法永远不会结束。连接成功建立但我猜Jsoup创建Document对象时会出现问题。

有没有办法跳过这样的情况？

Answer 1

我真的不明白你为什么使用maxBodySize（0）选项。我已经尝试过没有它，它运行正常并返回文档。

Jsoup解析卡在无限循环中

1 个答案: