所以我从网站上检索一些JSON数据并使用urllib2中的标准urlopen():
代码:
url = 'http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=%s' % (api_key,)
print urlopen(url).read()
结果:
{
"status":"OK",
"copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
"results": {
"chamber": "Senate",
"start_date": "2011-12-14",
"end_date": "2011-12-14",
"num_results": "0",
"votes": [
]
}
}
这个结果是错误的。如果我手动将URL输入浏览器:
输入:
http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=a55a8988c8102a532221d2d465ca010a:1:65328279
我明白了:
{
"status":"OK",
"copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
"results": {
"chamber": "Senate",
"start_date": "2011-12-14",
"end_date": "2011-12-14",
"num_results": "1",
"votes": [
{
"congress": "112",
"session": "1",
"roll_call": "228",
"vote_uri":"http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/228.json",
"bill_number": "S.J.Res.24",
"question": "On the Joint Resolution",
"description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
"vote_type": "1/2",
"date": "2011-12-14",
"time": "11:09:00",
"result": "Joint Resolution Defeated",
"democratic": {
"yes": "20",
"no": "31",
"present": "0",
"not_voting": "0",
"majority_position": "No"
},
"republican": {
"yes": "1",
"no": "46",
"present": "0",
"not_voting": "0",
"majority_position": "No"
},
"independent": {
"yes": "0",
"no": "2",
"present": "0",
"not_voting": "0"
},
"total": {
"yes": "21",
"no": "79",
"present": "0",
"not_voting": "0"
}
} ]
}
}
另外,当我使用CURL检索数据时,我得到:
$curl http://api.nytimes.com/svc/politics/v3/us/legislative/congress/senate/votes/2011-12-14/2011-12-14.json?api-key=a55a8988c8102a532221d2d465ca010a:1:65328279
{
"status":"OK",
"copyright":"Copyright (c) 2011 The New York Times Company. All Rights Reserved.",
"results": {
"chamber": "Senate",
"start_date": "2011-12-14",
"end_date": "2011-12-14",
"num_results": "2",
"votes": [
{
"congress": "112",
"session": "1",
"roll_call": "229",
"vote_uri": "http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/229.json",
"bill_number": "S.J.Res.10",
"question": "On the Joint Resolution",
"description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
"vote_type": "2/3",
"date": "2011-12-14",
"time": "11:39:00",
"result": "Joint Resolution Defeated",
"democratic": {
"yes": "0",
"no": "51",
"present": "0",
"not_voting": "0",
"majority_position": "No"
},
"republican": {
"yes": "47",
"no": "0",
"present": "0",
"not_voting": "0",
"majority_position": "Yes"
},
"independent": {
"yes": "0",
"no": "2",
"present": "0",
"not_voting": "0"
},
"total": {
"yes": "47",
"no": "53",
"present": "0",
"not_voting": "0"
}
}, {
"congress": "112",
"session": "1",
"roll_call": "228",
"vote_uri": "http://api.nytimes.com/svc/politics/v3/us/legislative/congress/112/senate/sessions/1/votes/228.json",
"bill_number": "S.J.Res.24",
"question": "On the Joint Resolution",
"description": "Joint resolution proposing a balanced budget amendment to the Constitution of the United States.",
"vote_type": "1/2",
"date": "2011-12-14",
"time": "11:09:00",
"result": "Joint Resolution Defeated",
"democratic": {
"yes": "20",
"no": "31",
"present": "0",
"not_voting": "0",
"majority_position": "No"
},
"republican": {
"yes": "1",
"no": "46",
"present": "0",
"not_voting": "0",
"majority_position": "No"
},
"independent": {
"yes": "0",
"no": "2",
"present": "0",
"not_voting": "0"
},
"total": {
"yes": "21",
"no": "79",
"present": "0",
"not_voting": "0"
}
} ]
}
}
对于同一个网络请求,这是3个不同的结果。这是什么问题?是否有一些我忽略的缓存机制或者我错过了什么?
答案 0 :(得分:1)
您的请求中似乎必须设置“Accept-Encoding”标头。
以防万一,您可以像这样设置:
request = urllib2.Request(your_url)
request.add_header('Accept-Encoding',whatever_in_this_case)
opener = urllib2.build_opener()
print opener.open(request).read()