文件由curl下载但不是由node.js

时间:2017-04-20 10:59:40

标签: node.js powershell http curl http-headers

所以我试图通过在浏览器中打开的nodejs下载文件,甚至可以在curl等工具中下载。 但是由于某种原因,nodejs因为下载文件而失败。我尝试通过节点中的request模块和名为download-cli的节点cli模块下载文件。他们两个都失败了400或404响应,但文件通过像curl这样的常规工具下载。

可能是什么问题?我已经尝试将用户代理设置为Firefox的用户代理(它打开就好了),但这并不能解决问题。我假设问题不是关于用户代理,因为curl没有自己的用户代理。

有问题的网址可以是alicdn的任何网址,但我们以此为例: https://ae01.alicdn.com/kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-часы-поддержка-синхронизации-notifier-sim-карты-подключение-bluetooth-для-android-apple-iphone.jpg_640x640.jpg

通过在PowerShell中使用节点download-cli工具和Invoke-WebRequest工具运行上述网址来获得响应。

PS C:\code> download https://ae01.alicdn.com/kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-часы-поддержка-син
хронизации-notifier-sim-карты-подключение-bluetooth-для-android-apple-iphone.jpg_640x640.jpg
Couldn't connect to https://ae01.alicdn.com/kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-часы-поддержка-синхронизации-notifier-sim-карты-подключение-bluetooth-для-android-apple-iphone.jpg_640x640.jpg (404)
PS C:\code> curl https://ae01.alicdn.com/kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-часы-поддержка-синхрон
изации-notifier-sim-карты-подключение-bluetooth-для-android-apple-iphone.jpg_640x640.jpg


StatusCode        : 200
StatusDescription : OK
Content           : {255, 216, 255, 224...}
RawContent        : HTTP/1.1 200 OK
                    X-Application-Context: fileserver2-download:prod:7001
                    From-Req-Dns-Type: NA,NA
                    SERVED-FROM: 72.247.178.95
                    Connection: keep-alive
                    Network_Info: DE_FRANKFURT_16509
                    Timing-Allow-Ori...
Headers           : {[X-Application-Context, fileserver2-download:prod:7001], [From-Req-Dns-Type, NA,NA], [SERVED-FROM, 72.247.178.95],
                    [Connection, keep-alive]...}
RawContentLength  : 114927

2 个答案:

答案 0 :(得分:0)

好的,所以我尝试通过节点的本机http模块下载文件,我尝试通过流行的request模块下载,我尝试通过一个名为download-cli的基于节点的cli工具下载。他们每个人都有同样的反应。 所以我启动了Wireshark,并试图确切地看到请求的不同之处,结果发现像curlInvoke-WebRequest之类的工具在发出GET请求之前逃脱了路径但是节点的本机模块没有这样做。这是唯一的区别。使用转义的网址可以正常工作。

Invoke-WebRequest的GET路径:

GET /kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-%D1%87%D0%B0%D1%81%D1%8B-%D0%BF%D0%BE%D0%B4%D0%B4%D0%B5%D1%80%D0%B6%D0%BA%D0%B0-%D1%81%D0%B8%D0%BD%D1%85%D1%80%D0%BE%D0%BD%D0%B8%D0%B7%D0%B0%D1%86%D0%B8%D0%B8-notifier-sim-%D0%BA%D0%B0%D1%80%D1%82%D1%8B-%D0%BF%D0%BE%D0%B4%D0%BA%D0%BB%D1%8E%D1%87%D0%B5%D0%BD%D0%B8%D0%B5-bluetooth-%D0%B4%D0%BB%D1%8F-android-apple-iphone.jpg_640x640.jpg HTTP/1.1

节点的GET路径:

GET /kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-G0AK-?>445@6:0-A8=E@>=870F88-notifier-sim-:0@BK-?>4:;NG5=85-bluetooth-4;O-android-apple-iphone.jpg_640x640.jpg HTTP/1.1

答案 1 :(得分:0)

为什么你没有这样做:

$url='https://ae01.alicdn.com/kf/HTB1ftVmPVXXXXXUXVXXq6xXFXXXG/Langtek-smart-watch-gt12-%D1%87%D0%B0%D1%81%D1%8B-%D0%BF%D0%BE%D0%B4%D0%B4%D0%B5%D1%80%D0%B6%D0%BA%D0%B0-%D1%81%D0%B8%D0%BD%D1%85%D1%80%D0%BE%D0%BD%D0%B8%D0%B7%D0%B0%D1%86%D0%B8%D0%B8-notifier-sim-%D0%BA%D0%B0%D1%80%D1%82%D1%8B-%D0%BF%D0%BE%D0%B4%D0%BA%D0%BB%D1%8E%D1%87%D0%B5%D0%BD%D0%B8%D0%B5-bluetooth-%D0%B4%D0%BB%D1%8F-android-apple-iphone.jpg_640x640.jpg'
Invoke-WebRequest -Uri $url -OutFile C:\temp\android-apple-iphone.jpg_640x640.jpg