解析HTML url jsoup android

时间:2014-08-02 20:38:51

标签: android html parsing jsoup

我有这个HTML:

<ul class="programList">

            <li class="showing">
                <div class="filterTime"></div>
                <div class="filterGenre"></div>
                <div class="outerSmallPoster">
                    <span class="posterBanner"></span>
                    <a href="/filmdatabase/06-juni/22-jump-street/">
                        <img src="/fileshare/filarkivroot/AuroraKino/Filmer/06%20-%20juni/22%20Jump%20Street/kynoefo11.jpg?width=160" class="smallPoster" /></a>
                </div>
                <div class="movieDescr">
                    <a class="movieTitle" href="/filmdatabase/06-juni/22-jump-street/">22 Jump Street</a>

                    <p class="movieDescription">Channing Tatum og Jonah Hill er tilbake i rollene som radarparet Jenko og Schmidt i oppfølgeren til...</p>
                </div>
                <div class="outerMovieDescription">



                    <div class="outerProgramTicketSale">


                        <button type="button" data-frames="underholdning" data-hour="21" href="http://91.207.226.164/ticketweb.php?sign=2&UserCenterID=100007&PaymentType=000&ShowID=484797&PaymentTypeSelection=&ErrorCode=0" target="_blank" onclick="openTicket(100007,484797);return false;" data-usercenterid="100007" data-showid="484797" class="programTime">
                            21:15

                        </button>
                        <span class="theater">Sal 6</span>
                    </div>

                </div>
            </li>

</ul>

此代码位于我尝试从Android应用程序解析的URL。为此,我编写了以下代码:

    protected Hashtable<String, Elements> doInBackground(Void... params) {
        // Get all the movies from aurorakino and return a list of them
        // to the postexecute method.

        MainActivity.out("Bakgrunn");

        Hashtable<String, Elements> map = new Hashtable<String, Elements>();

        Document doc;
        try {
            doc = (Document) Jsoup.connect("http://fokus.aurorakino.no/billetter-og-program/").get();

            Elements title = doc.select("a.movieTitle");
            Elements desc = doc.select("p.movieDescription");
            Elements image = doc.select("img.smallPoster");

            MainActivity.out(title.size());

            MainActivity.out("Vi er inne i try");

            map.put("title", title);
            map.put("desc", desc);
            map.put("image", image);

            return map;
        } 
        catch (IOException e) {
        // TODO Auto-generated catch block
            MainActivity.out("Noe gikk galt");
        }

        return null;
    }

此代码位于AsyncTask中,任务正在运行。我从asynctask中的每个阶段打印一些调试信息,以便我可以看到它是否实际运行。它就是。

我的问题是即使html页面有这样的链接:

<a href="xxx" class="movieTitle">movie title></a> 

代码没有设法找到它们中的任何一个。我打印元素大小,它显示为零。

当我删除类规范时,它会找到73个标题。

我做错了什么?

非常感谢

0 个答案:

没有答案