CURL和urlopen给出了相互矛盾的结果

时间:2011-12-14 18:25:27

标签: python curl urllib2 urlopen

所以我从网站上检索一些JSON数据并使用urllib2中的标准urlopen():

代码:

url = 'http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=%s' % (api_key,)
print urlopen(url).read()  

结果:

{
   "status":"OK",
   "copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
   "results": {
      "chamber": "Senate",
      "start_date": "2011-12-14",
      "end_date": "2011-12-14",
      "num_results": "0",
      "votes": [
                 ]
    }
}

这个结果是错误的。如果我手动将URL输入浏览器:

输入:

http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=a55a8988c8102a532221d2d465ca010a:1:65328279

我明白了:

{
   "status":"OK",
   "copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
   "results": {
      "chamber": "Senate",
      "start_date": "2011-12-14",
      "end_date": "2011-12-14",
      "num_results": "1",
  "votes": [
            {
       "congress": "112",
       "session": "1",
       "roll_call": "228",
"vote_uri":"http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/228.json",
       "bill_number": "S.J.Res.24",
       "question": "On the Joint Resolution", 
       "description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
       "vote_type": "1/2",
       "date": "2011-12-14",
       "time": "11:09:00",
       "result": "Joint Resolution Defeated",
       "democratic": {
         "yes": "20",
         "no": "31",
         "present": "0",
         "not_voting": "0",
         "majority_position": "No"
        },
       "republican": {
         "yes": "1",
         "no": "46",
         "present": "0",
         "not_voting": "0",
         "majority_position": "No"
        },
       "independent": {
         "yes": "0",
         "no": "2",
         "present": "0",
         "not_voting": "0"
        },
       "total": {
         "yes": "21",
         "no": "79",
         "present": "0",
         "not_voting": "0"
        }
      }                 ]
}

}

另外,当我使用CURL检索数据时,我得到:

$curl http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=a55a8988c8102a532221d2d465ca010a:1:65328279
{
   "status":"OK",
   "copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
   "results": {
      "chamber": "Senate",
      "start_date": "2011-12-14",
      "end_date": "2011-12-14",
      "num_results": "2",
       "votes": [
            {
       "congress": "112",
       "session": "1",
       "roll_call": "229",
       "vote_uri": "http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/229.json",
       "bill_number": "S.J.Res.10",
       "question": "On the Joint Resolution", 
       "description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
       "vote_type": "2/3",
       "date": "2011-12-14",
       "time": "11:39:00",
       "result": "Joint Resolution Defeated",
       "democratic": {
         "yes": "0",
         "no": "51",
         "present": "0",
         "not_voting": "0",
         "majority_position": "No"
        },
       "republican": {
         "yes": "47",
         "no": "0",
         "present": "0",
         "not_voting": "0",
         "majority_position": "Yes"
        },
       "independent": {
         "yes": "0",
         "no": "2",
         "present": "0",
         "not_voting": "0"
        },
       "total": {
         "yes": "47",
         "no": "53",
         "present": "0",
         "not_voting": "0"
        }
      },                {
       "congress": "112",
       "session": "1",
       "roll_call": "228",
       "vote_uri": "http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/228.json",
       "bill_number": "S.J.Res.24",
       "question": "On the Joint Resolution", 
       "description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
       "vote_type": "1/2",
       "date": "2011-12-14",
       "time": "11:09:00",
       "result": "Joint Resolution Defeated",
       "democratic": {
         "yes": "20",
         "no": "31",
         "present": "0",
         "not_voting": "0",
         "majority_position": "No"
        },
       "republican": {
         "yes": "1",
         "no": "46",
         "present": "0",
         "not_voting": "0",
         "majority_position": "No"
        },
       "independent": {
         "yes": "0",
         "no": "2",
         "present": "0",
         "not_voting": "0"
        },
       "total": {
         "yes": "21",
         "no": "79",
         "present": "0",
         "not_voting": "0"
        }
      }                 ]
}

}

对于同一个网络请求,这是3个不同的结果。这是什么问题?是否有一些我忽略的缓存机制或者我错过了什么?

1 个答案:

答案 0 :(得分:1)

您的请求中似乎必须设置“Accept-Encoding”标头。

以防万一,您可以像这样设置:

request = urllib2.Request(your_url)
request.add_header('Accept-Encoding',whatever_in_this_case)
opener = urllib2.build_opener()
print opener.open(request).read()