Question

我正致力于通过散景自动绘制代谢组学数据的绘图功能。目前，我正在尝试从CSV读取数据框并迭代生成每个代谢物（列）的框图。

我有一个示例df，如下所示：

Sample  Group   AMP     ADP     ATP
1A      A       239847  239084  987374
1B      A       245098  241210  988950
2A      B       238759  200554  921032
2B      B       230029  215408  89980

以下是我的代码：

import pandas
from bokeh.plotting import figure, output_file, show, save
from bokeh.charts import BoxPlot

df = pandas.read_csv("testdata_2.csv")

for colname, col in df.iteritems():
p = BoxPlot(df, values=df[colname], label='Group', xlabel='Group', ylabel='Peak Area',
             title=colname)
    output_file("boxplot.html")
    show(p)

这会产生错误：

raise ValueError("expected an element of either %s, got %r" % (nice_join(self.type_params), value))

ValueError: expected an element of either Column Name or Column String or List(Column Name or Column String

似乎设置values=df[colname]是个问题。如果我用values=df['colname']替换它，它会给我一个colname的关键错误。如果我指定一个给定的列，例如values='ATP'，我可以很好地绘制，但我需要能够遍历所有列。

任何指导？这甚至是最好的方法吗？

提前致谢。

Answer 1

如果你想水平组织它们，你可以创建不同的图形，然后你可以使用例如bokeh.io中的hplot，如下所示：

import pandas
from bokeh.plotting import figure, output_file, show, save
from bokeh.charts import BoxPlot
from bokeh.io import hplot

df = pandas.read_csv("testdata_2.csv")

p = []
for colname in ['AMP','ADP','ATP']:
    p += [BoxPlot(df, values=colname, label='Group', xlabel='Group',
            ylabel='Peak Area',title=colname, width=250,height=250)]

output_file("boxplot.html")
show(hplot(*p))

对于您的具体示例，我得到：

将列数据从pandas df传递到散景图功能

1 个答案: