使用collections.Counter

时间:2018-10-22 22:02:10

标签: python list nested counter element

我有一个这样的列表

L = [['16.37.123.153','119.222.456.130','38673','161','17','62','4646'],
 ['16.37.456.153','119.222.123.112','56388','161','17','62','4646'],..]

我想通过collections.counter获得该列表中出现次数最多的5个元素。

当我使用此解决方案时:

mostfrequentelements, counterofelements = zip(*Counter(L).most_common(5))

我得到了错误:

Traceback (most recent call last):
  File "hypergraph.py", line 65, in <module>
    mostfrequent, countermfi = zip(*Counter(L).most_common(5)) 
  File "/usr/lib/python2.7/collections.py", line 477, in __init__
    self.update(*args, **kwds)
  File "/usr/lib/python2.7/collections.py", line 567, in update
    self[elem] = self_get(elem, 0) + 1
TypeError: unhashable type: 'list'

如何将其用于列表类型?我需要一种具有最佳时间复杂度的解决方案。

输入:

L = [['16.37.123.153','119.222.456.130','38673','161','17','62','4646'],
     ['16.37.456.153','119.222.123.112','56388','161','17','62','4646'],..]

输出:

Mostfrequentlelements = list of 5 most occuring sublists
Counter = list of the occurence counts of the 5 sublists

预先感谢您, 问候:)

1 个答案:

答案 0 :(得分:0)

您正在传递嵌套列表而不是序列。使用计数器之前,您需要访问列表L中的每个列表。

您可以创建一个计数器列表,以后再获取值。

L = [['16.37.123.153','119.222.456.130','38673','161','17','62','4646'],
 ['16.37.456.153','119.222.123.112','56388','161','17','62','4646']]

from collections import Counter

c = [Counter(i) for i in L]

for item in c:
    print(item.most_common(5))

更新

要计算每个列表,可以将它们连接到字符串并计数:

L = [['16.37.123.153','119.222.456.130','38673','161','17','62','4646'],
 ['16.37.456.153','119.222.123.112','56388','161','17','62','4646']]

from collections import Counter

L = [','.join(i) for i in L]
c = Counter(L).most_common(5)

Mostfrequentlelements = [i[0] for i in c]
Counts = [i[1] for i in c]