解析公司信息

时间:2013-02-09 15:32:19

标签: java parsing web-scraping html-parsing finance

我想知道是否有人知道如何成功解析公司名称“Alcoa Inc.”显示在下面的URL中。展示图片要容易得多,但我没有足够的声誉。任何帮助,将不胜感激。

http://www.google.com/finance?q=NYSE%3AAA&ei=LdwVUYC7Fp_YlgPBiAE

这是我到目前为止尝试使用jsoup来解析div类:

 <div class="appbar-snippet-primary">
<span>Alcoa Inc.</span>
</div>



public Elements htmlParser(String url, String element, String elementType, String returnElement){


        try {

            Document doc = Jsoup.connect(url).get();
            Document parse = Jsoup.parse(doc.html());

            if (returnElement == null){

                return parse.select(elementType + "." + element);

            }

            else {

                return parse.select(elementType + "." + element + " " + returnElement);

            }


        }  

public String htmlparseGoogleStocks(String url){

        String pr = "pr";
        String appbar_center = "appbar-snippet-primary";
        String val = "val";

        String span = "span";
        String div = "div";
        String td = "td";

        Elements price_data;
        Elements title_data;
        Elements more_data;

        price_data = htmlParser(url, pr, span, null);

        title_data = htmlParser(url, appbar_center, div, span);

        //more_data = htmlParser(url, val, td, null);

        //String stockprice = price_data.text().toString();
        String title = title_data.text().toString();

        //System.out.println(more_data.text());

        return title;

1 个答案:

答案 0 :(得分:1)

我自己,我会分析感兴趣的源HTML页面,然后使用JSoup来提取信息。例如,使用非常小的JSoup程序,如:

import java.io.IOException;

import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.select.Elements;

public class GoogleFinance {

   public static final String PAGE = "https://www.google.com/finance?q=NASDAQ:XONE";  

   public static void main(String[] args) throws IOException {
      Document doc = Jsoup.connect(PAGE).get();

      Elements title = doc.select("title");

      System.out.println(title.text());
   }
}

你会得到回报:

  

ExOne Co:纳斯达克:XONE报价&amp;新闻 - Google财经

它没有那么容易。