Python中的自定义JSON sort_keys顺序

时间:2010-12-09 19:54:29

标签: python json sorting

Python 2.6中有没有办法向JSON的sort_keys提供自定义键或cmp函数?

我有一个来自JSON的词典列表,如下所示:

[
  {
    "key": "numberpuzzles1",
    "url": "number-puzzle-i.html",
    "title": "Number Puzzle I",
    "category": "nestedloops",
    "points": "60",
    "n": "087"
  },
  {
     "key": "gettingindividualdigits",
     "url": "getting-individual-digits.html",
     "title": "Getting Individual Digits",
     "category": "nestedloops",
     "points": "80",
     "n": "088"
  }
]

...我已将其存储到列表变量assigndb中。我希望能够加载JSON,修改它,并使用dumps(或其他)将其序列化,保持密钥的顺序

到目前为止,我尝试过这样的事情:

ordering = {'key': 0, 'url': 1, 'title': 2, 'category': 3,
             'flags': 4, 'points': 5, 'n': 6}

def key_func(k):
    return ordering[k]

# renumber assignments sequentially
for (i, a) in enumerate(assigndb):
    a["n"] = "%03d" % (i+1)

s = json.dumps(assigndb, indent=2, sort_keys=True, key=key_func)

...但当然dumps不支持list.sort()之类的自定义键。可能有自定义JSONEncoder的东西?我似乎无法实现目标。

5 个答案:

答案 0 :(得分:12)

一个想法(用2.7测试):

import json
import collections
json.encoder.c_make_encoder = None
d = collections.OrderedDict([("b", 2), ("a", 1)])
json.dumps(d)
# '{"b": 2, "a": 1}'

请参阅:OrderedDict + issue6105。 Python 2.x似乎只需要c_make_encoder hack。不是直接的解决方案,因为您必须更改dict的{​​{1}} s,但它仍然可用。我检查了json库(encode.py),并且有序是硬编码的:

OrderedDict

答案 1 :(得分:4)

这有点难看,但是如果tokland的解决方案不适合你:

data = [{'category': 'nestedloops', 'title': 'Number Puzzle I', 'url': 'number-puzzle-i.html', 'n': '087', 'points': '60', 'key': 'numberpuzzles1'}, {'category': 'nestedloops', 'title': 'Getting Individual Digits', 'url': 'getting-individual-digits.html', 'n': '088', 'points': '80', 'key': 'gettingindividualdigits'}]
ordering = {'key': 0, 'url': 1, 'title': 2, 'category': 3,
            'flags': 4, 'points': 5, 'n': 6}
outlist = []
for d in data:
    outlist.append([])
    for k in sorted(d.keys(), key=lambda k: ordering[k]):
        outlist[-1].append(json.dumps({k: d[k]}))

for i, l in enumerate(outlist):
    outlist[i] = "{" + ",".join((s[1:-1] for s in outlist[i])) + "}"

s = "[" + ",".join(outlist) + "]"

答案 2 :(得分:0)

我遇到了同样的问题而且collections.OrderedDict不适合该任务,因为它按字母顺序排序所有内容。所以我写了一些类似于Andrew Clark的解决方案:

def json_dumps_sorted(data, **kwargs):
    sorted_keys = kwargs.get('sorted_keys', tuple())
    if not sorted_keys:
        return json.dumps(data)
    else:
        out_list = []
        for element in data:
            element_list = []
            for key in sorted_keys:
                if key in element:
                    element_list.append(json.dumps({key: element[key]}))
            out_list.append('{{{}}}'.format(','.join((s[1:-1] for s in element_list))))
        return '[{}]'.format(','.join(out_list))

你这样使用它:

json_string = json_dumps_sorted([
    {
        "key": "numberpuzzles1",
        "url": "number-puzzle-i.html",
        "title": "Number Puzzle I",
        "category": "nestedloops",
        "points": "60",
        "n": "087"
    }, {
        "key": "gettingindividualdigits",
        "url": "getting-individual-digits.html",
        "title": "Getting Individual Digits",
        "category": "nestedloops",
        "points": "80",
        "n": "088"
    }
], sorted_keys=(
    'key', 
    'url', 
    'title', 
    'category',
    'flags', 
    'points', 
    'n'
))

答案 3 :(得分:0)

具有“前置”和“附加”键的紧凑而强大的递归实现:https://gist.github.com/jeromerg/91f73d5867c5fa04ee7dbc0c5a03d611

def sort_recursive(node, first_keys, last_keys):
    """ Sort the dictionary entries in a whole JSON object tree"""
    fixed_placements = {
        **{key: (0, idx) for idx, key in enumerate(first_keys)},
        **{key: (2, idx) for idx, key in enumerate(last_keys)},
    }
    return _sort_recursive(node, lambda key: fixed_placements.get(key, (1, key)))


def _sort_recursive(node, key_fn):
    if isinstance(node, list):
        return [_sort_recursive(val, key_fn) for val in node]
    elif isinstance(node, dict):
        sorted_keys = sorted(node.keys(), key=key_fn)
        return {k:_sort_recursive(node[k], key_fn) for k in sorted_keys}
    else:
        return node

答案 4 :(得分:-1)

感谢。我需要设置一个时间戳键:值在我的JSON对象的顶部,无论如何。显然,排序键将其拧紧,因为它以" t"。

开头

使用类似的东西,同时立即将时间戳键放在dict_data中:

d = collections.OrderedDict(dict_data)