Python从包含字典的嵌套列表创建新列表

时间:2020-08-11 05:49:40

标签: python dictionary data-structures nested-lists

给出的是输入列表,其中包含字典。

Input=[
       {"rbd":"x","cat":"xxx","origin":"us"},
       {"rbd":"y","cat":"xxx","origin":"us"},
       {"rbd":"z","cat":"xxx","origin":"us"},
       {"rbd":"q","cat":"xxx","origin":"us"},
       {"rbd":"1","cat":"xxy","origin":"us"},
       {"rbd":"2","cat":"xxy","origin":"us"}
                                              ]

我正在尝试将其作为输出。 在这里,如果输入列表中字典的键“ cat”和键“ origin”相同,则将它们添加到一个“ rbd”中。否则它会创建一个新的。下面是输出。

Output=[{"rbd":["x","y","z","q"],"cat":"xxx","origin":"us"},
        {"rbd":["l","2"],"cat":"xxy","origin":"us"}]

我尝试了以下不完整的实现。首先将List转换成字典。会引发错误。

def Convert(lst): 
    Input_Dict = {lst[i]: lst[i + 1] for i in range(0, len(lst), 2)} 
    return Input_Dict

4 个答案:

答案 0 :(得分:4)

尝试itertools.groupby

from itertools import groupby

Input = [{"rbd": "x", "cat": "xxx", "origin": "us"},
         {"rbd": "y", "cat": "xxx", "origin": "us"},
         {"rbd": "z", "cat": "xxx", "origin": "us"},
         {"rbd": "q", "cat": "xxx", "origin": "us"},
         {"rbd": "1", "cat": "xxy", "origin": "us"},
         {"rbd": "2", "cat": "xxy", "origin": "us"}]

print([{"rbd": [i["rbd"] for i in item], "cat": k[0], "origin": k[1]} for k, item in groupby(Input, key=lambda x: (x["cat"], x["origin"]))])

这给了我

[{'rbd': ['x', 'y', 'z', 'q'], 'cat': 'xxx', 'origin': 'us'}, {'rbd': ['1', '2'], 'cat': 'xxy', 'origin': 'us'}]

答案 1 :(得分:1)

您应该遍历Input,并为每个元素检查cat和origin是否存在于Output中。如果已经存在,请将输入的rbd放在输出中。如果不存在,请在“输出”中创建一个新地图。

请注意,这不是最有效的答案。

答案 2 :(得分:1)

解决此难题的一种方法是通过defaultdict模块中的collections

from collections import defaultdict

d = defaultdict(list)

for entry in Input:
    d[(entry["cat"], entry["origin"])].append(entry["rbd"])

print(d)
defaultdict(list,
        {('xxx', 'us'): ['x', 'y', 'z', 'q'], ('xxy', 'us'): ['1', '2']})


keys = ("rbd", "cat", "origin")

#list comprehension to get final output
[dict(zip(keys, (value, *key))) for key, value in d.items()]

[{'rbd': ['x', 'y', 'z', 'q'], 'cat': 'xxx', 'origin': 'us'},
 {'rbd': ['1', '2'], 'cat': 'xxy', 'origin': 'us'}]

答案 3 :(得分:0)

一个人可以执行Itertools.groupby,如此处答案之一所述。 另一种方法是将字典列表转换为数据框,然后执行groupby,然后再转换回字典列表。

df = pd.DataFrame(Input)
df1 = df.groupby(by=['origin', 'cat'])['rbd'].apply(list).reset_index()
df_dicts = df1.to_dict("records")
print(df_dicts)