Question

我正在读取CSV文件：

 Notation Level   RFResult   PRIResult   PDResult  Total Result
 AAA       1       1.23        0           2         3.23
 AAA       1       3.4         1           0         4.4
 BBB       2       0.26        1           1.42      2.68
 BBB       2       0.73        1           1.3       3.03
 CCC       3       0.30        0           2.73      3.03
 DDD       4       0.25        1           1.50      2.75
 AAA       5       0.25        1           1.50      2.75
 FFF       6       0.26        1           1.42      2.68
 ...
 ...

这是代码

import pandas as pd
import matplotlib.pyplot as plt

df = pd.rad_csv('home\NewFiles\Files.csv')
Notation = df['Notation']
Level = df['Level']
RFResult = df['RFResult']
PRIResult = df['PRIResult']
PDResult = df['PDResult']

fig, axes = plt.subplots(nrows=7, ncols=1)
ax1, ax2, ax3, ax4, ax5, ax6, ax7 = axes.flatten()
n_bins = 13
ax1.hist(data['Total'], n_bins, histtype='bar') #Current this shows all Total Results in one plot 
plt.show()

我想在每个不同的轴上显示每个“关卡总计结果”，如下所示：

ax1将显示1级总结果

ax2将显示2级总结果

ax3将显示3级总结果

ax4将显示4级总结果

ax5将显示5级总结果

ax6将显示6级总结果

ax7将显示7级总结果

Answer 1

您可以仅通过建立索引df[df['Level'] == level]['Total']来选择数据框的过滤部分。您可以使用for ax in axes.flatten()在轴之间循环。要获取索引，请使用for ind, ax in enumerate(axes.flatten())。请注意，Python通常从1开始计数，因此将1加到索引将是指示级别的好选择。

请注意，当字符串中包含反斜杠时，可以使用r字符串r'home\NewFiles\Files.csv'对其进行转义。

默认ylim是从0到最大条形高度，加上一些填充。可以分别为每个ax进行更改。在下面的示例中，使用ymax值列表来说明原理。

ax.grid(True, axis='both)设置该ax的网格。代替“两者”，也可以仅使用“ x”或“ y”设置该轴的网格。为每个刻度值绘制一条网格线。（下面的示例尝试使用很少的空间，因此只有几条网格线可见。）

import matplotlib.pyplot as plt
import pandas as pd
import numpy as np

N = 1000
df = pd.DataFrame({'Level': np.random.randint(1, 6, N), 'Total': np.random.uniform(1, 5, N)})

fig, axes = plt.subplots(nrows=5, ncols=1, sharex=True)
ymax_per_level = [27, 29, 28, 26, 27]
for ind, (ax, lev_ymax) in enumerate(zip(axes.flatten(), ymax_per_level)):
    level = ind + 1
    n_bins = 13
    ax.hist(df[df['Level'] == level]['Total'], bins=n_bins, histtype='bar')
    ax.set_ylabel(f'TL={level}') # to add the level in the ylabel
    ax.set_ylim(0, lev_ymax)
    ax.grid(True, axis='both')
plt.show()

PS：具有自定义图例和自定义垂直线的堆叠直方图可以创建为：

import matplotlib.pyplot as plt
from matplotlib.patches import Patch
import pandas as pd
import numpy as np

N = 1000
df = pd.DataFrame({'Level': np.random.randint(1, 6, N),
                   'RFResult': np.random.uniform(1, 5, N),
                   'PRIResult': np.random.uniform(1, 5, N),
                   'PDResult': np.random.uniform(1, 5, N)})
df['Total'] = df['RFResult'] + df['PRIResult'] + df['PDResult']

fig, axes = plt.subplots(nrows=5, ncols=1, sharex=True)
colors = ['crimson', 'limegreen', 'dodgerblue']
column_names = ['RFResult', 'PRIResult', 'PDResult']
level_vertical_line = [1, 2, 3, 4, 5]
for level, (ax, vertical_line) in enumerate(zip(axes.flatten(), level_vertical_line), start=1):
    n_bins = 13
    level_data = df[df['Level'] == level][column_names].to_numpy()
    # vertical_line = level_data.mean()
    ax.hist(level_data, bins=n_bins,
            histtype='bar', stacked=True, color=colors)
    ax.axvline(vertical_line, color='gold', ls=':', lw=2)
    ax.set_ylabel(f'TL={level}')  # to add the level in the ylabel
    ax.margins(x=0.01)
    ax.grid(True, axis='both')
legend_handles = [Patch(color=color) for color in colors]
axes[0].legend(legend_handles, column_names, ncol=len(column_names), loc='lower center', bbox_to_anchor=(0.5, 1.02))
plt.show()

在不同的轴上绘制直方图

1 个答案: