如何在Windows上安装Boilerpipe?

时间:2012-04-09 12:02:42

标签: java netbeans web-scraping web-mining boilerpipe

谁能告诉我如何在Netbeans的窗户上使用套管?如果你能给我一些java代码,我将不胜感激。

1 个答案:

答案 0 :(得分:0)

尝试查看他们的Wiki及其QuickStart。以下示例代码......

public static void main(final String[] args) throws Exception {
    URL url;
    url = new URL("http://www.example.com/some-location/index.html");

    // NOTE We ignore HTTP-based character encoding in this demo...
    final InputStream urlStream = url.openStream();
    final InputSource is = new InputSource(urlStream);

    final BoilerpipeSAXInput in = new BoilerpipeSAXInput(is);
    final TextDocument doc = in.getTextDocument();
    urlStream.close();

    // You have the choice between different Extractors

    // System.out.println(DefaultExtractor.INSTANCE.getText(doc));
    System.out.println(ArticleExtractor.INSTANCE.getText(doc));
}