java https登录和浏览

时间:2010-11-03 13:43:00

标签: java sockets login https safe-browsing

我想使用java登录互联网上的https网站,然后阅读一些信息。我已经用萤火虫看了一下标题,但我无法成功...

Firebug告诉:

https://service.example.net/xxx/unternehmer/login.html?login=Anmelden&loginname=xxx&password=xxx&sessionid=&sprache=de

然后我想浏览这个网站:

https://service.example.net/xxx/unternehmer/ausgabe.html?code=PORTAL;sessionid=03112010150442

我怎么能用java做到这一点? 我已经尝试过类似的东西:

import java.net.*;
import java.io.*;
import java.security.*;
import javax.net.ssl.*;

public class HTTPSClient {

  public static void main(String[] args) {
    int port = 443; // default https port
    String host = "service.example.net";
    try {
      SSLSocketFactory factory = (SSLSocketFactory) SSLSocketFactory.getDefault();

      SSLSocket socket = (SSLSocket) factory.createSocket(host, port);

      // enable all the suites
      String[] supported = socket.getSupportedCipherSuites();
      socket.setEnabledCipherSuites(supported);


      Writer out = new OutputStreamWriter(socket.getOutputStream());
      // https requires the full URL in the GET line
      out.write("POST https://" + host + "//xxx/unternehmer/login.html?login=Anmelden&loginname=xxx&password=xxx&sessionid=&sprache=de HTTP/1.1\r\n");
      out.write("Host: " + host + "\r\n");
      out.write("\r\n");
      out.flush();

      // read response
      BufferedReader in = new BufferedReader(
        new InputStreamReader(socket.getInputStream()));

      // read the header
      String s;
      while (!(s = in.readLine()).equals("")) {
          System.out.println(s);
      }
      System.out.println();

      // read the length
      String contentLength = in.readLine();
      int length = Integer.MAX_VALUE;
      try {
        length = Integer.parseInt(contentLength.trim(), 16);
      }
      catch (NumberFormatException ex) {
        // This server doesn't send the content-length
        // in the first line of the response body
      }
      System.out.println(contentLength);

      int c;
      int i = 0;
      while ((c = in.read()) != -1 && i++ < length) {
        System.out.write(c);
      }

      System.out.println("1.part done");

      out.close();
      in.close();
      socket.close();

    }
    catch (IOException ex) {
      System.err.println(ex);
    }

  }

}

不幸的是,这对登录不起作用.... 我也不知道在哪里获得这个sessionid ...每次它都是不同的。 我希望你能帮助我。 ps:我用xxx替换了一些相关信息

2 个答案:

答案 0 :(得分:4)

问题解决了:))

首先我从apache添加了库:

  1. httpclient
  2. 公地HttpClient的
  3. 公地编解码器
  4. 共享记录
  5. 然后我结合了几个教程。

    我的代码:

    import java.io.BufferedWriter;
    import java.io.FileWriter;
    import org.apache.commons.httpclient.Header;
    import org.apache.commons.httpclient.HttpClient;
    import org.apache.commons.httpclient.methods.GetMethod;
    import org.apache.commons.httpclient.methods.PostMethod;
    import org.apache.http.client.params.CookiePolicy;
    
      public class Test {
    
         public static final String TARGET_HTTPS_SERVER = "www.example.net"; 
         public static final int    TARGET_HTTPS_PORT   = 443; 
    
         public static void main(String[] args) throws Exception {
    
             HttpClient httpClient = new HttpClient();
             httpClient.getParams().setCookiePolicy(CookiePolicy.BROWSER_COMPATIBILITY);
    
             PostMethod post = new PostMethod("https://www.example.com/login.html");
             post.setRequestHeader(new Header(
                     "User-Agent", "Mozilla/5.0 /Windows; U; Windows NT 4.1; de; rv:1.9.1.5) Gecko/20091102 Firefox/3.0"));
    
             post.addParameter("login", "true");
             post.addParameter("username", "xxx");
             post.addParameter("password", "xxx");
             post.addParameter("language", "de");
             httpClient.executeMethod(post);
    
    
             System.out.println(post.getResponseBodyAsString());
             String body=post.getResponseBodyAsString();
    //Get the session id by parsing the code, i know this is not pretty
                 String sessionid=body.substring(body.indexOf("session")+10,body.indexOf("session")+10+14);
                 System.out.print(sessionid);
    
    
                 GetMethod get=new GetMethod("https://www.example.com/thesiteyouwannabrowse?sessionid="+sessionid);
    
             get.setRequestHeader(new Header(
                 "User-Agent", "Mozilla/5.0 /Windows; U; Windows NT 4.1; de; rv:1.9.1.5) Gecko/20091102 Firefox/3.0"));
             httpClient.executeMethod(get);
    
             System.out.println(get.getResponseBodyAsString());
             //write it into a file
             try{
                    // Create file 
                    FileWriter fstream = new FileWriter("file.html");
                        BufferedWriter out = new BufferedWriter(fstream);
                    out.write(get.getResponseBodyAsString());
                    //Close the output stream
                    out.close();
                    }catch (Exception e){//Catch exception if any
                      System.err.println("Error: " + e.getMessage());
                 }     
             post.releaseConnection();
         }
      }
    

答案 1 :(得分:3)

我自己做过类似的事情。我使用这种“手动”方法让它工作,但这非常麻烦,特别是对于cookie管理。

我建议你看一下Apache HttpClient library。 (当我意识到使用这个库是多么容易时,我扔掉了我的代码。)

正如org.life.java所指出的,这里http://hc.apache.org/httpclient-3.x/sslguide.html是如何使用此库开始使用SSL的好方法。