PHP cURL导致403 Forbidden

时间:2016-04-11 17:59:38

标签: php curl http-status-code-403

我正在尝试cURL导致403 Forbidden错误的远程站点。从同一台服务器,我可以通过终端运行以下两个命令。第一个失败而第二个失败。如何让我的PHP代码与第二个终端命令相匹配?

此终端命令不会返回任何内容。

curl http://www.barneys.com

此终端命令会产生正确的结果(页面的HTML)

curl -L http://www.barneys.com

我的PHP代码:

$ch = curl_init('http://www.barneys.com');
$http_headers = array(
'User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:40.0)' . 'Gecko/20100101 Firefox/40.0',
'Accept: */*',
'X-Requested-With: XMLHttpRequest',
'Referer: http://www.barneys.com', # IMPORTANT
'Accept-Language: pt-BR,en-US;q=0.7,en;q=0.3',
);
curl_setopt($ch, CURLOPT_HTTPHEADER, $http_headers);
curl_setopt($ch, CURLOPT_USERAGENT,'Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.13) Gecko/20080311 Firefox/2.0.0.13');
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_AUTOREFERER, true);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
curl_setopt($ch, CURLOPT_REFERER, 'http://www.barneys.com');
curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, 0); 
curl_setopt($ch, CURLOPT_TIMEOUT, 60);
$response = curl_exec($ch);
$redirectURL = curl_getinfo($ch,CURLINFO_EFFECTIVE_URL);
curl_close($ch);

echo $response;

编辑:以下是来自cURL请求的PHP日志:

* About to connect() to www.barneys.com port 80 (#0)
*   Trying 23.204.27.110... * connected
* Connected to www.barneys.com (23.204.27.110) port 80 (#0)
> GET / HTTP/1.1
Host: www.barneys.com
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:40.0)Gecko/20100101 Firefox/40.0
Accept: */*
X-Requested-With: XMLHttpRequest
Referer: http://www.barneys.com
Accept-Language: pt-BR,en-US;q=0.7,en;q=0.3
Connection: keep-alive

< HTTP/1.1 403 Forbidden
< Server: AkamaiGHost
< Mime-Version: 1.0
< Content-Type: text/html
< Content-Length: 265
< Expires: Mon, 11 Apr 2016 23:22:16 GMT
< Date: Mon, 11 Apr 2016 23:22:16 GMT
< Connection: close
< 
* Closing connection #0

1 个答案:

答案 0 :(得分:2)

您的php cURL请求中不包含请求标头。要解决此问题,请在设置CURLOPT_HTTPHEADER选项之前添加以下行:

curl_setopt($ch, CURLOPT_HEADER, true);

来自PHP curl docs

  

CURLOPT_HEADER TRUE,在输出中包含标题。

此外,如果您向网址添加尾部斜杠,则不需要通过cURL重建网址。您的许多代码似乎都是不必要的。这是一个精简的工作示例:

<?php
$ch = curl_init('http://www.barneys.com/');
$http_headers = array(
    'User-Agent: Junk', // Any User-Agent will do here
);
curl_setopt($ch, CURLOPT_HEADER, true);
curl_setopt($ch, CURLOPT_HTTPHEADER, $http_headers);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
$response = curl_exec($ch);
curl_close($ch);

echo $response;