Question

在我的应用程序中，我需要下载Instagram个人资料的HTML源代码并解析它以获取一些信息（媒体和后续计数）。这是我的代码（适用于我测试过的所有网站，除了Instagram）：

try {
            InputStream in;
            URL url = new URL(urlString);

            URLConnection conn = url.openConnection();
            if(!(conn instanceof HttpURLConnection))
                throw new NoConnectionException("not instanceof http");

            HttpURLConnection httpConn = (HttpURLConnection) conn;
            httpConn.setAllowUserInteraction(false);
            httpConn.setInstanceFollowRedirects(true);
            httpConn.setRequestMethod("GET");

            in = httpConn.getInputStream();

            BufferedReader br = new BufferedReader(new InputStreamReader(in));
            String line;
            String source = "";
            while((line = br.readLine()) != null)
                source += line;
            br.close();
} catch(Exception e) {}

当我使用LogCat调试它时，String source为空。

Answer 1

使用Jsoup进行HTML解析。它非常简单方便。从这个答案开始，并按照文件link

如何下载Instagram页面的HTML源代码

1 个答案: