Question

这是我的Python问题：

我被要求生成一个输出表，其中包含每个变量中的Nan数（数据中有超过10个变量），min，max，mean，std，25％，50％和70％。我在熊猫中使用了describe函数创建了描述表，它给了我想要的一切但每个变量中的Nan数。我正在考虑将nan的数量作为新行添加到describe输出生成的输出中。

任何人都可以帮忙吗？

output = input_data.describe(include=[np.number]) # this gives the table output

count_nan = input_data.isnull().sum(axis=0) # this counts the number of Nan of each variable

如何将第二行作为一行添加到第一个表中？

Answer 1

您可以使用.append向DataFrame追加新行：

In [21]: output.append(pd.Series(count_nan, name='nans'))
Out[21]: 
              0         1         2         3         4
count  4.000000  4.000000  4.000000  4.000000  4.000000
mean   0.583707  0.578610  0.566523  0.480307  0.540259
std    0.142930  0.358793  0.309701  0.097326  0.277490
min    0.450488  0.123328  0.151346  0.381263  0.226411
25%    0.519591  0.406628  0.478343  0.406436  0.429003
50%    0.549012  0.610845  0.607350  0.478787  0.516508
75%    0.613127  0.782827  0.695530  0.552658  0.627764
max    0.786316  0.969421  0.900046  0.582391  0.901610
nans   0.000000  0.000000  0.000000  0.000000  0.000000

如何在Panda的输出中添加一个新行描述python中的函数输出

1 个答案: