Redirecting the site to a new address, but not sending the 3XX code

I am trying to crawl the site http://www.daxontech.com "If we enter the above address into a web browser, the address will automatically change to http://www.benqmaterials.com/index.aspx

I tried using the response code to redirect my program to the above site, but the response code is 200. Is there a way to get the above address. I use Jsoup to crawl. Thanks in advance.

+4
source share
2 answers

The site you specified uses a meta-redirect that occurs after the headers are sent to the client.

<meta http-equiv="refresh" content="0; url=http://www.BenQMaterials.com/index.aspx">     

-, , , , html ( ).

+3
$ curl "http://www.daxontech.com" -i
HTTP/1.1 200 OK
Content-Type: text/html
Last-Modified: Tue, 04 Jan 2011 08:47:58 GMT
Accept-Ranges: bytes
ETag: "61cc416ecabcb1:0"
Server: Microsoft-IIS/7.5
X-Powered-By: ASP.NET
Date: Tue, 07 Jan 2014 10:11:11 GMT
Content-Length: 304

<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=big5">
<meta http-equiv="refresh" content="0; url=http://www.BenQMaterials.com/index.aspx">      
<title>BenQ Materials, the top four manufacturers of polarizer in the world.</title>
</head>
<body>
</body>
</html>

,

<meta http-equiv="refresh" content="0; url=http://www.BenQMaterials.com/index.aspx">

http-equiv HTTP- / .

http-equiv HTTP.

refresh: . :

<meta http-equiv="refresh" content="300">

. "refresh" , . "" - W3C.

, , , xpath ("//meta[@http-equiv='refresh']");, regex, <meta http-equiv="refresh", .

+1

All Articles