从列表python中删除重复项

时间:2015-12-11 04:33:07

标签: python

我有两个清单:

list1 = [IM12345, IM12346, IM12347, IM12348]
list2 = [ID300, ID404, ID300, ID601]

list2与相应的list1值相关联。 list1具有唯一值,其中list2具有重复项。

我希望list2唯一对应的关联值将添加到list2值。

Dict= {ID300: {IM12345, IM12347}, ID404: IM12346, ID601: IM12348}

上面的模式可以是列表,集合或字典。

我应该使用python中的哪个算法来获得上述结果?

5 个答案:

答案 0 :(得分:3)

您可以尝试collections.defaultdict

from collections import defaultdict
d = defaultdict(set)

list1 = ['IM12345', 'IM12346', 'IM12347', 'IM12348']
list2 = ['ID300', 'ID404', 'ID300', 'ID601']

for key, value in zip(list2, list1):
    d[key].add(value)

演示:

>>> d
defaultdict(<class 'set'>, {'ID300': {'IM12345', 'IM12347'}, 'ID404': {'IM12346'}, 'ID601': {'IM12348'}})
>>>
>>>
>>> for i, j in d.items():
...     print(i, j)
...     
... 
ID601 {'IM12348'}
ID300 {'IM12345', 'IM12347'}
ID404 {'IM12346'}

答案 1 :(得分:1)

您可以创建一个dict来保存数据集

list1 = ["IM12345", "IM12346", "IM12347", "IM12348"]
list2 = ["ID300", "ID404", "ID300", "ID601"]

dictResult=dict()
i=0
for item in list2:
    print item
    if dictResult.has_key(item):
        dictResult[item].append(list1[i])
    else:
        dictResult[item]=[list1[i]]
    i=i+1

print dictResult

结果:

{'ID404': ['IM12346'], 'ID300': ['IM12345', 'IM12347'], 'ID601': ['IM12348']}

答案 2 :(得分:0)

可能不完全是一种pythonic方式但是 - 在这里: 映射输入:

map = dict(zip(list1, list2))

现在你可以进行逆映射:

inv_map = {}
for k, v in map.iteritems():
    inv_map[v] = inv_map.get(v, [])
    inv_map[v].append(k)

以上示例的结果:

>>> inv_map
{'ID404': ['IM12346'], 'ID300': ['IM12345', 'IM12347'], 'ID601': ['IM12348']}

答案 3 :(得分:0)

另一种方法是使用列表操作。

yourList = ["1","2","3","4","1","2"] 
newList = [] 
for f in yourList:
    if f not in newList:
       newList.append(f)

答案 4 :(得分:0)

简单的解决方案

from collections import defaultdict

list1 = ['IM12345', 'IM12346', 'IM12347', 'IM12348']
list2 = ['ID300', 'ID404', 'ID300', 'ID601']

d = defaultdict(list)

for n in range(len(list2)):
    d[list2[n]].append(list1[n])

print d.items()

结果:

[('ID404', ['IM12346']), ('ID300', ['IM12345', 'IM12347']), ('ID601', ['IM12348'])]

Python2.7 Documentation----defaultdict