我想问一下,如何使用熊猫将这个json数据集标准化?
{
"customer_id": "C23",
"basket": [{
"product_id": "P32",
"price": 1173
}, {
"product_id": "P27",
"price": 1943
}],
"date_of_purchase": "2018-12-01 23:34:00"
}
我希望输出是这样的:
| customer_id | product_id | price | date_of_purchase |
答案 0 :(得分:1)
使用pd.io.json.json_normalize
指定您的meta
和record_paths
参数
pd.io.json.json_normalize(json_txt, 'basket', ['customer_id', 'date_of_purchase'])
输出
price product_id customer_id date_of_purchase
0 1173 P32 C23 2018-12-01 23:34:00
1 1943 P27 C23 2018-12-01 23:34:00