在箱线图中叠加中值/方差的数值

时间:2013-09-17 22:42:20

标签: python matplotlib boxplot

在Python中使用箱形图时,有没有办法自动/轻松覆盖中位数和&的值。每个盒子顶部的方差(或至少是中位数的数值)?

E.g。在下面的箱子图中,我想在每个箱子图上叠加文本(中位数,+ - 标准)。

enter image description here

2 个答案:

答案 0 :(得分:17)

假设您使用boxplot函数绘制箱线图,它会返回一个包含图表组件的字典。请注意,该框表示内部四分位数范围(25至75百分位数),而不是标准差。

>>> bp_dict = boxplot(data, vert=False) # draw horizontal boxplot
>>> bp_dict.keys()
>>> bp_dict.keys()
['medians', 'fliers', 'whiskers', 'boxes', 'caps']

这些包含构成每个绘图元素的Line2D对象。您可以使用Line2D.get_xydata方法获取中位数和框位置(在数据坐标中)以确定文本的位置。

from pylab import *

# from http://matplotlib.org/examples/pylab_examples/boxplot_demo.html

# fake up some data
spread= rand(50) * 100
center = ones(25) * 50
flier_high = rand(10) * 100 + 100
flier_low = rand(10) * -100
data =concatenate((spread, center, flier_high, flier_low), 0)

# fake up some more data
spread= rand(50) * 100
center = ones(25) * 40
flier_high = rand(10) * 100 + 100
flier_low = rand(10) * -100
d2 = concatenate( (spread, center, flier_high, flier_low), 0 )
data.shape = (-1, 1)
d2.shape = (-1, 1)
#data = concatenate( (data, d2), 1 )
# Making a 2-D array only works if all the columns are the
# same length.  If they are not, then use a list instead.
# This is actually more efficient because boxplot converts
# a 2-D array into a list of vectors internally anyway.
data = [data, d2, d2[::2,0]]

# multiple box plots on one figure
figure()

# get dictionary returned from boxplot
bp_dict = boxplot(data, vert=False)

for line in bp_dict['medians']:
    # get position data for median line
    x, y = line.get_xydata()[1] # top of median line
    # overlay median value
    text(x, y, '%.1f' % x,
         horizontalalignment='center') # draw above, centered

for line in bp_dict['boxes']:
    x, y = line.get_xydata()[0] # bottom of left line
    text(x,y, '%.1f' % x,
         horizontalalignment='center', # centered
         verticalalignment='top')      # below
    x, y = line.get_xydata()[3] # bottom of right line
    text(x,y, '%.1f' % x,
         horizontalalignment='center', # centered
             verticalalignment='top')      # below

show()

boxplot output

答案 1 :(得分:-1)

稍微修正:

for line in bp_dict['medians']:
  # get position data for median line
  x, y = line.get_xydata()[1]  # top of median line
  # overlay median value
  text(x, y, '%.1f' % x, horizontalalignment='center')  # draw above, centered

for box in bp_dict['boxes']:
  x, y = box.get_path().vertices[0]  # bottom of left line
  text(x, y, '%.1f' % x, horizontalalignment='center',  # centered
  verticalalignment='top')      # below
  x, y = box.get_path().vertices[6]  # bottom of right line
  text(x, y, '%.1f' % x,
    horizontalalignment='center',  # centered
    verticalalignment='top')      # below