如何使用,beautifulsoup和python从这个json中提取数据?

时间:2016-03-23 14:08:59

标签: python django beautifulsoup

如何获得这两个值utc_last_updated并给出以下json的名称? 我使用了请求来获取内容,然后我使用BeautifulSoup来制作它现在的样子。但现在我只想提取我所展示的两个值。

"data": [
        {
            "scm": "hg",
            "has_wiki": false,
            "last_updated": "2016-03-23T14:05:27.433",
            "no_forks": false,
            "created_on": "2016-03-18T22:55:52.705",
            "owner": "user",
            "email_mailinglist": "",
            "is_mq": false,
            "size": 420034,
            "read_only": false,
            "fork_of": null,
            "mq_of": null,
            "state": "available",
            "utc_created_on": "2016-03-18 21:55:52+00:00",
            "website": "",
            "description": "",
            "has_issues": false,
            "is_fork": false,
            "slug": "store",
            "is_private": true,
            "name": "store",
            "language": "python",
            "utc_last_updated": "2016-03-23 13:05:27+00:00",
            "no_public_forks": true,
            "creator": null,
            "resource_uri": "/1.0/repositories/my_url"
        },
        {
            "scm": "hg",
            "has_wiki": false,
            "last_updated": "2016-03-18T12:26:22.261",
            "no_forks": false,
            "created_on": "2016-03-18T12:19:08.262",
            "owner": "user",
            "email_mailinglist": "",
            "is_mq": false,
            "size": 173137,
            "read_only": false,
            "fork_of": null,
            "mq_of": null,
            "state": "available",
            "utc_created_on": "2016-03-18 11:19:08+00:00",
            "website": "",
            "description": "",
            "has_issues": false,
            "is_fork": false,
            "name": 'foo'
            "is_private": true,,
            "language": "python",
            "utc_last_updated": "2016-03-18 11:26:22+00:00",
            "no_public_forks": true,
            "creator": null,
            "resource_uri": "/1.0/repositories/my_rl"
        },

} 我将不胜感激任何帮助。

1 个答案:

答案 0 :(得分:4)

你有一个 JSON回复,而不是HTML - 用json module解析:

import json

data = json.loads(response)
for item in data["data"]:
    print(item["utc_last_updated"])