java urlconnection获取最终重定向的URL

时间:2013-02-19 07:06:52

标签: java

我有一个重定向到另一个网址的网址。我希望能够获得最终重定向的网址。我的代码:

    public class testURLConnection
    {
    public static void main(String[] args) throws MalformedURLException, IOException {

    HttpURLConnection con =(HttpURLConnection) new URL( "http://tinyurl.com/KindleWireless" ).openConnection();

    System.out.println( "orignal url: " + con.getURL() );
    con.connect();

System.out.println( "connected url: " + con.getURL() );
InputStream is = con.getInputStream();
System.out.println( "redirected url: " + con.getURL() );
is.close();

} }

它总是提供原始网址,而redirectURL是:http://www.amazon.com/Kindle-Wireless-Reading-Display-Globally/dp/B003FSUDM4/ref=amb_link_353259562_2?pf_rd_m=ATVPDKIKX0DER&pf_rd_s=center-10&pf_rd_r=11EYKTN682A79T370AM3&pf_rd_t=201&pf_rd_p=1270985982&pf_rd_i=B002Y27P3M

如何获得此最终重定向网址。

这是我尝试使用循环直到我们获得重定向的内容。仍然可以获取所需的网址:

    public static String fetchRedirectURL(String url) throws IOException
    {
HttpURLConnection con =(HttpURLConnection) new URL( url ).openConnection();
//System.out.println( "orignal url: " + con.getURL() );
con.setInstanceFollowRedirects(false);
con.connect();


InputStream is = con.getInputStream();
if(con.getResponseCode()==301)
    return con.getHeaderField("Location");
else return null;
    }
    public static void main(String[] args) throws MalformedURLException, IOException {
String url="http://tinyurl.com/KindleWireless";
String fetchedUrl=fetchRedirectURL(url);
System.out.println("FetchedURL is:"+fetchedUrl);
while(fetchedUrl!=null)
{   url=fetchedUrl;
System.out.println("The url is:"+url);
    fetchedUrl=fetchRedirectURL(url);


}
System.out.println(url);

    }

7 个答案:

答案 0 :(得分:14)

试试这个,我递归使用许多重定向网址。

public static String getFinalURL(String url) throws IOException {
    HttpURLConnection con = (HttpURLConnection) new URL(url).openConnection();
    con.setInstanceFollowRedirects(false);
    con.connect();
    con.getInputStream();

    if (con.getResponseCode() == HttpURLConnection.HTTP_MOVED_PERM || con.getResponseCode() == HttpURLConnection.HTTP_MOVED_TEMP) {
        String redirectUrl = con.getHeaderField("Location");
        return getFinalURL(redirectUrl);
    }
    return url;
}

并使用:

public static void main(String[] args) throws MalformedURLException, IOException {
    String fetchedUrl = getFinalURL("<your_url_here>");
    System.out.println("FetchedURL is:" + fetchedUrl);

}

答案 1 :(得分:4)

public static String getFinalRedirectedUrl(String url) {

    HttpURLConnection connection;
    String finalUrl = url;
    try {
        do {
            connection = (HttpURLConnection) new URL(finalUrl)
                    .openConnection();
            connection.setInstanceFollowRedirects(false);
            connection.setUseCaches(false);
            connection.setRequestMethod("GET");
            connection.connect();
            int responseCode = connection.getResponseCode();
            if (responseCode >= 300 && responseCode < 400) {
                String redirectedUrl = connection.getHeaderField("Location");
                if (null == redirectedUrl)
                    break;
                finalUrl = redirectedUrl;
                System.out.println("redirected url: " + finalUrl);
            } else
                break;
        } while (connection.getResponseCode() != HttpURLConnection.HTTP_OK);
        connection.disconnect();
    } catch (Exception e) {
        e.printStackTrace();
    }
    return finalUrl;
}

答案 2 :(得分:2)

我的第一个想法是将instanceFollowRedirects设置为false,或者使用URLConnection代替。

在这两种情况下,重定向都不会被执行,因此您将收到对原始请求的回复。获取HTTP状态值,如果是3xx,则获取新的重定向值。

当然可能存在一系列重定向,因此您可能需要进行迭代,直至到达真实(状态2xx)页面。

答案 3 :(得分:1)

@ user719950在我的MAC-OSX上 - 这解决了截断的HTTP URL问题:

在原始代码中,只需将以下内容添加到以下行://您必须通过浏览器找到IE / Chrome正在发送的请求标头。我仍然没有解释为什么这个简单的设置导致正确的URL:)

HttpURLConnection con =(HttpURLConnection) new URL
( "http://tinyurl.com/KindleWireless" ).openConnection();
 con.setInstanceFollowRedirects(true);
 con.setDoOutput(true);
  System.out.println( "orignal url: " + con.getURL() );     
         **con.setRequestProperty("User-Agent",
    "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_8_2) 
    AppleWebKit/536.26.17 (KHTML, like Gecko) Version/6.0.2  
   Safari/536.26.17");**                  

           con.connect();
    System.out.println( "connected url: " + con.getURL() );
    Thread.currentThread().sleep(2000l);
    InputStream is = con.getInputStream();
    System.out.println( "redirected url: " + con.getURL() );

    is.close();

答案 4 :(得分:1)

这可能会有所帮助

public static void main(String[] args) throws MalformedURLException,
    IOException {

HttpURLConnection con = (HttpURLConnection) new URL(
        "http://tinyurl.com/KindleWireless").openConnection(proxy);
    System.out.println("orignal url: " + con.getURL());
    con.connect();
    con.setInstanceFollowRedirects(false);
    int responseCode = con.getResponseCode();
    if ((responseCode / 100) == 3) {
        String newLocationHeader = con.getHeaderField("Location");
        responseCode = con.getResponseCode();
        System.out.println("Redirected Location " + newLocationHeader);
        System.out.println(responseCode);
    }

}

答案 5 :(得分:0)

@JEETS 您的fetchRedirectURL函数可能无法正常工作,因为重定向有多种HTTP代码。将其更改为范围检查,它将起作用。

public static String fetchRedirectURL(String url) throws IOException
    {
HttpURLConnection con =(HttpURLConnection) new URL( url ).openConnection();
//System.out.println( "orignal url: " + con.getURL() );
con.setInstanceFollowRedirects(false);
con.connect();

InputStream is = con.getInputStream();
if(con.getResponseCode()>=300 && con.getResponseCode() <400)
    return con.getHeaderField("Location");
else return null;
    }

答案 6 :(得分:0)

如果存在多个重定向,则会递归递送:

protected String getDirectUrl(String link) {
    String resultUrl = link;
    HttpURLConnection connection = null;
    try {
        connection = (HttpURLConnection) new URL(link).openConnection();
        connection.setInstanceFollowRedirects(false);
        connection.connect();
        int responseCode = connection.getResponseCode();
        if (responseCode == HttpURLConnection.HTTP_MOVED_PERM || responseCode == HttpURLConnection.HTTP_MOVED_TEMP) {
            String locationUrl = connection.getHeaderField("Location");

            if (locationUrl != null && locationUrl.trim().length() > 0) {
                IOUtils.close(connection);
                resultUrl = getDirectUrl(locationUrl);
            }
        }
    } catch (Exception e) {
        log("error getDirectUrl", e);
    } finally {
        IOUtils.close(connection);
    }
    return resultUrl;
}