我正在使用Xsoup。
这是代码:
private void updateSeed(Document document) {
mappingParser
.setSeed(Xsoup.compile("//div[@class='pgCell'][last()]/a/@href")
.evaluate(document).get());
}
当我执行上一个功能时,我得到以下异常:
Exception in thread "main" org.jsoup.select.Selector$SelectorParseException: Could not parse query 'div[@class='pgCell'][last()]': unexpected token at 'last()'
at us.codecraft.xsoup.xevaluator.XPathParser.byFunction(XPathParser.java:225)
at us.codecraft.xsoup.xevaluator.XPathParser.consumePredicates(XPathParser.java:202)
at us.codecraft.xsoup.xevaluator.XPathParser.findElements(XPathParser.java:138)
at us.codecraft.xsoup.xevaluator.XPathParser.parse(XPathParser.java:51)
at us.codecraft.xsoup.xevaluator.XPathParser.parse(XPathParser.java:375)
at us.codecraft.xsoup.xevaluator.XPathParser.combinator(XPathParser.java:85)
at us.codecraft.xsoup.xevaluator.XPathParser.parse(XPathParser.java:49)
at us.codecraft.xsoup.xevaluator.XPathParser.parse(XPathParser.java:375)
at us.codecraft.xsoup.Xsoup.compile(Xsoup.java:27)
at com.qannoufit.test.CrawlerController.updateSeed(CrawlerController.java:102)
at com.qannoufit.test.CrawlerController.populateShouldParse(CrawlerController.java:91)
at com.qannoufit.test.CrawlerController.startCrawling(CrawlerController.java:60)
at com.qannoufit.test.Main.main(Main.java:12)
答案 0 :(得分:0)
尝试使用CSS选择器。初始xPath查询可以如下翻译:
div.pgCell:last-of-type > a
获得锚点后,获取其href。
private void updateSeed(Document document) {
Element anchor = document.select("div.pgCell:last-of-type > a").first();
if (anchor==null) {
// Anchor not found, handle error here...
}
mappingParser.setSeed(anchor.absUrl("href"));
}