我正在使用Python 2.7.6来解析JSON文件,但是我收到了一个错误,我不确定为什么。这是我第一次处理Python,所以这可能是一个非常基本的问题,但我看了一下堆栈溢出,似乎无法弄清楚是什么问题。
这是我解析数据的python代码:
import json
from pprint import pprint
with open ('test.json') as data_file:
data = json.load(data_file);
pprint(data)
这是我的JSON文件:
{"votes": {"funny": 0, "useful": 0, "cool": 0}, "user_id": "hckr9Hf8BUHcXfOSDv9eJA", "review_id": "K6EEJo0I8AbwGWvwe5SJYQ", "stars": 5, "date": "2013-05-05", "text": "This place is fantastic. they have a restaurant inside the grocery store. very good food.", "type": "review", "business_id": "uPezkdNi_x_SwWlf_2rcMw"}
{"votes": {"funny": 0, "useful": 0, "cool": 1}, "user_id": "PK3TxomYLwZuOXonmYqjNw", "review_id": "5ivy-tczAQ4WYrmVF6YoKg", "stars": 5, "date": "2013-08-11", "text": "This is going to be a place we go back to many times!", "type": "review", "business_id": "UB2j_EV3CIM_E4LcpadKMQ"}
这是我解析JSON时遇到的错误:
File "./parse.py", line 6, in <module>
data = json.load(data_file);
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/json/__init__.py", line 290, in load
**kw)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/json/__init__.py", line 338, in loads
return _default_decoder.decode(s)
File "/System/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/json/decoder.py", line 368, in decode
raise ValueError(errmsg("Extra data", s, end, len(s)))
ValueError: Extra data: line 2 column 1 - line 3 column 1 (char 318 - 600)
奇怪的是,如果我只有第一行JSON,我可以成功解析数据。任何帮助理解错误并帮助我解决它将非常感激。
答案 0 :(得分:3)
这不是有效的json - 你不能只是将两个哈希紧挨着彼此......尝试这样的事情。你会注意到我在它之间放了一个逗号并将整个哈希值放在一个数组中。
[
{
"stars" : 5,
"date" : "2013-05-05",
"review_id" : "K6EEJo0I8AbwGWvwe5SJYQ",
"text" : "This place is fantastic. they have a restaurant inside the grocery store. very good food.",
"user_id" : "hckr9Hf8BUHcXfOSDv9eJA",
"type" : "review",
"votes" : {
"funny" : 0,
"cool" : 0,
"useful" : 0
},
"business_id" : "uPezkdNi_x_SwWlf_2rcMw"
},
{
"stars" : 5,
"date" : "2013-08-11",
"review_id" : "5ivy-tczAQ4WYrmVF6YoKg",
"text" : "This is going to be a place we go back to many times!",
"user_id" : "PK3TxomYLwZuOXonmYqjNw",
"type" : "review",
"votes" : {
"funny" : 0,
"cool" : 1,
"useful" : 0
},
"business_id" : "UB2j_EV3CIM_E4LcpadKMQ"
}
]