如何使用jsoup发布表单登录?

时间:2014-04-27 07:15:53

标签: java jsoup

我要登录here enter image description here 源代码

<HTML><HEAD><TITLE>:: Dhaka Electric Supply Company Limited (DESCO)::</TITLE>
<META http-equiv=Content-Type content="text/html; charset=utf-8">
<META http-equiv=Pragma content=no-cache>
<META http-equiv=imagetoolbar content=no>
<META content="Home Page of  Dhaka Electric Supply Company Limited" name=description>
<META content="MSHTML 6.00.2900.2180" name=GENERATOR>
<style type="text/css">
    img{
        border:0px;
    }   
</style>
<script type="text/javascript" src="js/ajax-dynamic-content.js"></script>
<script type="text/javascript" src="js/ajax.js"></script>
<script type="text/javascript" src="js/ajax-tooltip.js"></script>   
<link rel="stylesheet" href="css/ajax-tooltip.css" media="screen" type="text/css">
<link rel="stylesheet" href="css/ajax-tooltip-demo.css" media="screen" type="text/css">
<LINK media=all href="css/global.css" type=text/css rel=stylesheet>


</HEAD> 
<BODY>
<DIV id=over style="DISPLAY: none"></DIV>
<DIV class=baselayout>
<DIV class=pagetopshadow></DIV>
<DIV class=basebg>
<DIV class=pageleft></DIV>
<DIV class=pagecenter>
<DIV id=Header>
</DIV>
<DIV id=Menu>
</DIV>
<DIV id=HomeContent>
<DIV class=right>
<DIV class=homeintro>

<div style="padding-top: 0px;">
<script> 
function checkLogin()
 { if( document.login.username.value == '') 
  { 
  alert( 'Please enter  your account number' );
   return false; 
   }return true;
   }
   alert('Payments through VISA and Master Card are stopped by DBBL. only DBBL Nexus card is allowed.');
    </script>

        <form NAME='login' METHOD='POST' ACTION='authentication.php'>

          <table width="350" height="181"cellpadding='0' cellspacing='0' border='0' style="border:#e5e5e5 0px solid; BACKGROUND: url(css/images/top9.png) no-repeat left top;" align="center">
          <tr> <td rowspan="6" style="padding-left:15px;"><img src="css/images/groupperms.gif" width="50" height="50"><td></tr>
            <tr>
              <td colspan="2" height="50"></td>
            </tr>
           <tr>
              <td nowrap><span class="textcolor1">Account No. </span></td>
              <td><input type='text' name='username' style="border:#cacaca 2px solid; color:#000099;" value=''></td></tr>

            <tr>
            <td> <a class="uiTooltip" href="#" onMouseOver="ajax_showTooltip(window.event,'tooltip/help.html',this);return false" onMouseOut="ajax_hideTooltip()">Help</a></td>
            <td><input name="login" type='submit' value='Login' style="width:80px; font-family:Arial, Helvetica, sans-serif; font-weight:bold; color:#FFFFFF; border:#000000 2px solid; cursor:pointer; background-color: #1b4585;"  border="0" align="right" title="Login"  onClick="return checkLogin();"/></td>
            </tr>

        </table>

        </form>
<table width="630" border="0" cellspacing="2" cellpadding="2" align="center" bgcolor="#FFFFFF" bordercolor="#FFFFFF">
  <tr>
    <td width="80"></td>
    <td><img src="images/right_3.gif"/></td>
    <td><span class="textcolor"><strong><a href='https://www.desco.org.bd/index.php?page=internet-bill-payment' target="_blank" class="uiTooltip">Before, use this facility/services please read instructions...</a></strong></span></td>
  </tr>
  </table>
<p align="center" class="textcolor"><strong><a href='https://www.desco.org.bd' class="uiTooltip">Back Home</a></strong></p>
<table align="center" width="135" border="0" cellpadding="2" cellspacing="0" title="Click to Verify - This site chose VeriSign SSL for secure e-commerce and confidential communications.">
<tr>
<td width="135" align="center" valign="top"><script type="text/javascript" src="https://seal.verisign.com/getseal?host_name=www.desco.org.bd&amp;size=S&amp;use_flash=YES&amp;use_transparent=YES&amp;lang=en"></script><br />
<a href="http://www.verisign.com/ssl-certificate/" target="_blank"  style="color:#000000; text-decoration:none; font:bold 7px verdana,sans-serif; letter-spacing:.5px; text-align:center; margin:0px; padding:0px;">ABOUT SSL CERTIFICATES</a></td>
</tr>
</table>


</div>
<div align="center" style="padding-top:10px;">



<CENTER><B>
    Total Visits:  1</B></CENTER>   
</div>
</DIV>
</DIV>
</DIV>
</DIV>
<DIV class=pageright></DIV></DIV>
<DIV class=pagebotshadow>
<DIV  id=Footer>Copyright © 2010 Dhaka Electric Supply Company Ltd. All rights reserved.</DIV>
</DIV>
</DIV>
</BODY>
</HTML>

基本上代码是

形式
<form NAME='login' METHOD='POST' ACTION='authentication.php'>

      <table width="350" height="181"cellpadding='0' cellspacing='0' border='0' style="border:#e5e5e5 0px solid; BACKGROUND: url(css/images/top9.png) no-repeat left top;" align="center">
      <tr> <td rowspan="6" style="padding-left:15px;"><img src="css/images/groupperms.gif" width="50" height="50"><td></tr>
        <tr>
          <td colspan="2" height="50"></td>
        </tr>
       <tr>
          <td nowrap><span class="textcolor1">Account No. </span></td>
          <td><input type='text' name='username' style="border:#cacaca 2px solid; color:#000099;" value=''></td></tr>

        <tr>
        <td> <a class="uiTooltip" href="#" onMouseOver="ajax_showTooltip(window.event,'tooltip/help.html',this);return false" onMouseOut="ajax_hideTooltip()">Help</a></td>
        <td><input name="login" type='submit' value='Login' style="width:80px; font-family:Arial, Helvetica, sans-serif; font-weight:bold; color:#FFFFFF; border:#000000 2px solid; cursor:pointer; background-color: #1b4585;"  border="0" align="right" title="Login"  onClick="return checkLogin();"/></td>
        </tr>

    </table>

    </form>

enter image description here

我正在尝试这个

package jsouptest;

import org.jsoup.Connection;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;

public class JsouptTest {
    public static void main(String[] args) throws Exception {

        Connection.Response loginForm = Jsoup.connect("https://www.desco.org.bd/ebill/login.php")
                .method(Connection.Method.GET)
                .execute();

        Document document = Jsoup.connect("https://www.desco.org.bd/ebill/login.php")
                .data("cookieexists", "false")
                .data("username", "32007702")
                .data("login", "Login")
                .cookies(loginForm.cookies())
                .post();
        System.out.println(document);

    }

}

但是我的错误

Exception in thread "main" java.net.SocketTimeoutException: Read timed out
    at java.net.SocketInputStream.socketRead0(Native Method)
    at java.net.SocketInputStream.read(SocketInputStream.java:152)
    at java.net.SocketInputStream.read(SocketInputStream.java:122)
    at sun.security.ssl.InputRecord.readFully(InputRecord.java:442)
    at sun.security.ssl.InputRecord.read(InputRecord.java:480)
    at sun.security.ssl.SSLSocketImpl.readRecord(SSLSocketImpl.java:927)
    at sun.security.ssl.SSLSocketImpl.performInitialHandshake(SSLSocketImpl.java:1312)
    at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1339)
    at sun.security.ssl.SSLSocketImpl.startHandshake(SSLSocketImpl.java:1323)
    at sun.net.www.protocol.https.HttpsClient.afterConnect(HttpsClient.java:563)
    at sun.net.www.protocol.https.AbstractDelegateHttpsURLConnection.connect(AbstractDelegateHttpsURLConnection.java:185)
    at sun.net.www.protocol.https.HttpsURLConnectionImpl.connect(HttpsURLConnectionImpl.java:153)
    at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:439)
    at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:424)
    at org.jsoup.helper.HttpConnection.execute(HttpConnection.java:178)
    at jsouptest.JsouptTest.main(JsouptTest.java:12)
我失踪了吗?如何解决?

3 个答案:

答案 0 :(得分:28)

您为执行POST请求而使用的URL是错误的,因为当您必须对表单执行特定请求时,您应该使用表单标记中存在的网页,在这种情况下“ authentication.php”。

所以代码将是:

    package jsouptest;

    import org.jsoup.Connection;
    import org.jsoup.Jsoup;
    import org.jsoup.nodes.Document;

    public class JsouptTest {
        public static void main(String[] args) throws Exception {

           Connection.Response loginForm = Jsoup.connect("https://www.desco.org.bd/ebill/login.php")
            .method(Connection.Method.GET)
            .execute();

           Document document = Jsoup.connect("https://www.desco.org.bd/ebill/authentication.php")
            .data("cookieexists", "false")
            .data("username", "32007702")
            .data("login", "Login")
            .cookies(loginForm.cookies())
            .post();
           System.out.println(document);

       }

    }

这个正确地检索了你想要的网页。

答案 1 :(得分:2)

对我来说问题是我没有发送viewstate,eventvalidation,viewstategenerator值。

要获取这些值,您必须先向登录表单页面发送GET请求。 在我的实例中,这是一个default.aspx页面。

然后你必须提取这些值并将它们放入变量中。当然,您还需要表单登录和密码字段ID,提交按钮的ID等。有关发送的所有变量的列表,请手动登录并使用chrome dev工具(检查元素)查看POST请求的网络选项卡。在里面它应该显示您提交的用户名和密码。在那里你会看到发送的其他变量。

network tab in dev tools example

接下来,发送包含所有这些变量的POST请求,将响应中的cookie保存到变量中,然后您可以使用它转到另一个页面。

代码:

import java.io.IOException;
import java.util.Map;
import org.jsoup.Connection;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.nodes.Element;
import org.jsoup.nodes.FormElement;


public class get_example {

public static void main(String[] args) throws IOException {
    //===================================
    Connection.Response response2 = Jsoup.connect("yourloginpage")
            .method(Connection.Method.GET)
            .execute();

    Document responseDocument = response2.parse();

    Element eventValidation = responseDocument.select("input[name=__EVENTVALIDATION]").first();
    Element viewState = responseDocument.select("input[name=__VIEWSTATE]").first();
    Element viewStateGen = responseDocument.select("input[name=__VIEWSTATEGENERATOR]").first();



    Connection.Response response = Jsoup.connect("yourloginpage")
            .method(Connection.Method.POST)
            .data("ctl00$plnMain$txtLogin", "username")
            .data("ctl00$plnMain$txtPassword", "password")
            .data("ctl00$plnMain$Submit1", "Log In")
            .data("ctl00$ucCopyright$hdnPrivacyStatement", "Privacy Statement")
            .data("ctl00$ucCopyright$hdnTermsOfUse", "Terms of Use")
            .data("__VIEWSTATE", viewState.attr("value"))
            .data("__VIEWSTATEGENERATOR", viewStateGen.attr("value"))
            .data("__EVENTVALIDATION", eventValidation.attr("value"))
            .execute();


    Map<String, String> cookies = response.cookies();


    Document homePage = Jsoup.connect("anotherpage")
            .cookies(cookies)
            .get();


    System.out.println(homePage.body().html());

}
}

答案 2 :(得分:0)

就我而言,它包含三个步骤:

  1. 从登录表单页面获取Cookie;
  2. 将凭据发布到登录表单action= URL,续订Cookie;
  3. 用户将被重定向到索引页面。请勿在此处解析,只需将Response发送到最终目标网址即可。
  4. 到达最后一页时解析。
  5. Cookie应该在每个步骤中更新,有时会添加条目,有时您必须清除它并重置。请务必使用Internet Explorer / Chrome / Firefox Dev工具( F12 )在网络面板中cookies部分检查网络传输。