sklearn confusion_matrix在错误的位置显示错误的尺寸/刻度线

时间:2019-11-29 20:51:26

标签: python scikit-learn confusion-matrix

我正试图显示一个混乱矩阵,但我无法弄清楚为什么它拒绝以适当的方式显示。这是我的代码:

import numpy as np
import itertools
from sklearn.metrics import confusion_matrix

def plot_confusion_matrix(cm, classes,
                          normalize=False,
                          title='Confusion matrix',
                          cmap=plt.cm.winter):
    if normalize:
        cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
    plt.imshow(cm, interpolation='nearest', cmap=cmap)
    plt.title(title, fontsize=30)
    plt.colorbar()
    tick_marks = np.arange(len(classes))
    plt.xticks(tick_marks, classes, fontsize=20)
    plt.yticks(tick_marks, classes, fontsize=20)

    fmt = '.2f' if normalize else 'd'
    thresh = cm.max() / 2.

    for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
        plt.text(j, i, format(cm[i, j], fmt), horizontalalignment="center", 
                 color="white" if cm[i, j] < thresh else "black", fontsize=40)

    plt.tight_layout()
    plt.ylabel('True label', fontsize=30)
    plt.xlabel('Predicted label', fontsize=30)

    return plt

cm = confusion_matrix(y_test, y_predicted_counts)
fig = plt.figure(figsize=(10, 10))
plot = plot_confusion_matrix(cm, classes=['Unsure','No','Yes'], normalize=False, title='Confusion matrix')
plt.show()
print(cm)

这是显示的内容:

bad confusion matrix

任何帮助将不胜感激。预先感谢。

3 个答案:

答案 0 :(得分:0)

对于imshow的调用,您需要指定origin='lower'(默认值为'upper';他们可能会在某个时间更改此设置,并且scikit-learn文档未更新其{ {3}})。因此,以下方法可以解决问题:

plt.imshow(cm, interpolation='nearest', cmap=cmap, origin='lower')
#                                                    ^
#                                                    |
# added origin='lower'  ------------------------------

答案 1 :(得分:0)

使用Matplotlib

如果要保留matplotlib实现,只需在plot_confusion_matrix函数的末尾添加plt.ylim(-0.5,2.5)

def plot_confusion_matrix(cm, classes,
                          normalize=False,
                          title='Confusion matrix',
                          cmap=plt.cm.winter):
    if normalize:
        cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
    plt.imshow(cm, interpolation='nearest', cmap=cmap)
    plt.title(title, fontsize=30)
    plt.colorbar()
    tick_marks = np.arange(len(classes))
    plt.xticks(tick_marks, classes, fontsize=20)
    plt.yticks(tick_marks, classes, fontsize=20)

    fmt = '.2f' if normalize else 'd'
    thresh = cm.max() / 2.

    for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
        plt.text(j, i, format(cm[i, j], fmt), horizontalalignment="center", 
                 color="white" if cm[i, j] < thresh else "black", fontsize=40)

    plt.tight_layout()
    plt.ylabel('True label', fontsize=30)
    plt.xlabel('Predicted label', fontsize=30)
    plt.ylim(-0.5, 2.5)  # <-- SOLUTION 

    return plt

使用Seaborn

您可以尝试使用seaborn软件包来绘制热图:

from sklearn.metrics import confusion_matrix
import pandas as pd
import seaborn as sn
import matplotlib.pyplot as plt   

def plot_confusion_matrix(cm, classes,
                          normalize=False,
                          title='Confusion matrix',
                          cmap=plt.cm.winter):
  cm_df = pd.DataFrame(cm, columns=classes, index = classes)
  cm_df.index.name = 'Actual'
  cm_df.columns.name = 'Predicted'
  plt.figure(figsize = (10,7))
  sn.set(font_scale=1.4)#for label size
  ax =sn.heatmap(cm_df, cmap=cmap, annot=True,annot_kws={"size": 16},fmt="d")# font size
  plt.title(title)
  bottom, top = ax.get_ylim()
  ax.set_ylim(bottom + 0.5, top - 0.5)
  plt.show()

plot_confusion_matrix(cm, classes=['Unsure','No','Yes'], normalize=False, title='Confusion matrix')

Confusion Matrix Result

希望这对您有用!

答案 2 :(得分:0)

您可能正在使用matplotlib 3.1.1,它打破了tick默认行为。升级到3.1.2或降级到3.1.0即可解决此问题。