pandas转换dataframe数据透视表

时间:2014-10-31 04:23:19

标签: python pandas pivot

我可以转换以下数据框:

   VALUE       COUNT  RECL_LCC  RECL_PI
0      1  15,686,114         3        1
1      2  27,537,963         1        1
2      3  23,448,904         1        2
3      4   1,213,184         1        3
4      5  14,185,448         3        2
5      6  13,064,600         3        3
6      7  27,043,180         2        2
7      8  11,732,405         2        1
8      9  14,773,871         2        3

这样的事情:

RECL_PI            1           2           3
RECL_LCC                                    
1         27,537,963  23,448,904   1,213,184
2         11,732,405  27,043,180  14,773,871
3         15,686,114  14,185,448  13,064,600

使用pandas pivot table:

plot_table = LCC_PI_df.pivot_table(index=['RECL_LCC'], columns='RECL_PI', values='COUNT', aggfunc='sum')

是否有一种快速方法来创建数据透视表,其中包含行总数的百分比而非原始总数?

1 个答案:

答案 0 :(得分:3)

根据评论,我认为你可以这样做,如下。请注意,我将COUNT列转换为整数来执行此操作:

#convert strings of the COUNT column to integers
import locale
locale.setlocale( locale.LC_ALL, 'en_US.UTF-8' ) 
LCC_PI_df.COUNT = LCC_PI_df.COUNT.apply(locale.atoi)

plot_table = LCC_PI_df.pivot_table(index=['RECL_LCC'], columns='RECL_PI', values='COUNT', aggfunc='sum')
#Calculate percentages
plot_table = plot_table.apply(lambda x : x / x.sum(), axis=1)