我想抓取网址" http://www.gc-zb.com/index/index.html" 但当我这样操作时,我得到了错误:
public class InvitedBids {
public static void main(String[] args) throws IOException {
InputStream inputStream=null;
HttpURLConnection httpConn=null;
InputStreamReader inputStreamReader=null;
BufferedReader bufferedReader=null;
StringBuilder contentBuf=null;
String myURL="http://www.gc-zb.com/index/index.html";
URL url= null;
try {
url = new URL(myURL);
System.out.println(url);
httpConn= (HttpURLConnection) url.openConnection();
httpConn.setRequestMethod("GET");
inputStream=httpConn.getInputStream(); //error occurs
inputStreamReader=new InputStreamReader(inputStream,"utf-8");
bufferedReader=new BufferedReader(inputStreamReader);
String line="";
contentBuf=new StringBuilder();
while ((line = bufferedReader.readLine())!= null) {
contentBuf.append(line);
}
String buf=contentBuf.toString();
System.out.println(buf);
} catch (Exception e) {
e.printStackTrace();
}finally {
//close I/O and HTTP
}
}
}
控制台说:
http://www.gc-zb.com/index/index.html
java.io.IOException: Server returned HTTP response code: 521 for URL: http://www.gc-zb.com/index/index.html
at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(Unknown Source)
at sun.net.www.protocol.http.HttpURLConnection.getInputStream(Unknown Source)
at com.feilong.bid.InvitedBids.main(InvitedBids.java:43)
任何知道如何解决它的人。谢谢你!
答案 0 :(得分:0)
例如:
Document doc = Jsoup.connect("http://example.com/").get();
String title = doc.title();
答案 1 :(得分:0)
推荐使用
org.apache.commons.httpclient.HttpClient
解决问题