Question

我有以下代码。我正在使用jsoup库从网站检索URL;之后，我正在检查URL是否包含我想要的关键字，并将它们列在另一个字符串中。我的问题是我无法只检索一个URL。看看我的代码：

// Get the webpage and parse it.
org.jsoup.nodes.Document doc = Jsoup.connect("http://www.examplepage").get();

// Get the anchors with href attribute.
// Or, you can use doc.select("a") to get all the anchors.
org.jsoup.select.Elements links = doc.select("a[href]");

// Iterate over all the links and process them.
for (org.jsoup.nodes.Element link : links) {
     String scrapedlinks += link.attr("abs:href")+"\n" ;
     String scrapedlinks3 ="";                  
}

 String[] links2 = links.split("\n");
 for (String newlink  : hulklinks  ) {
        if (newlink("mysearchterm")) {
            scrapedlinks3 +=newlink ;
            String[] scrapedlines = scrapedlinks3.split("\n" );
        }
 }

Answer 1

我认为如果您将网址直接存储在Arraylist中会更容易：

Arraylist<String> urls = new Arraylist<String>();
for (org.jsoup.nodes.Element link : links)
    urls.add(link.attr("abs:href"));

在此之后，您可以使用

轻松访问它们

urls.get(i);

多行字符串为每行分隔新字符串

1 个答案: