Question

我正在进行一些聚类，这会产生一个像这样的字符串列表：

['5-3-2', '5-3-2', '4-3-2-1', ...]

我想根据字符串的频率绘制条形图。是否有捷径可寻？我想我可以识别列表中的独特元素并计算它们，但也许有一个更舒适的解决方案？

编辑：更多信息

    import matplotlib.pyplot as plt
    import numpy as np
    import math as math
    import Utils as ut
    from sklearn.cluster import KMeans
    from itertools import cycle

...
    result = np.array(result)
    keys, counts = np.unique(result, return_counts=True)
    print('Keys: ', keys)
    print('Counts: ', counts)

    print(result)

    plt.bar(keys,counts)
    plt.show

输出：

Keys:  ['3-1-4-2' '3-2-3-2' '3-3-2-2' '4-2-2-2' '4-2-3-1' '4-4-2']
Counts:  [ 21 154  23   1  48   4]

编辑2：绘图显示在调试模式下，plt.show上有断点，当我跳过它消失时。因此它在运行模式下不可见。有什么建议吗？

Answer 1

np.unique可以返回列表中唯一元素的计数。

keys, counts = np.unique(x, return_counts=True)

然后你可以将它们绘制成条形图。

import matplotlib.pyplot as plt
import numpy as np

x = ['5-3-2', '5-3-2', '4-3-2', "2-3-2", '4-3-2', '4-3-2', "1-2-4"]
keys, counts = np.unique(x, return_counts=True)

plt.bar(keys, counts)
plt.show()

Answer 2

频率条形图基本上是直方图。幸运的是，matplotlib.pyplot内置了直方图方法！

假设您的列表名为x，您可以这样做：

import matplotlib.pyplot as plt
# the bins argument says how many bars (set to the number of unique values in your list)
# the rwidth argument is there so that the bars are not squished together
plt.hist(x, bins=len(set(x)), rwidth = 0.8)
plt.show()

这会为您提供列表中项目频率的直方图

基于字符串值列表的条形图

2 个答案: