从multiindex csv文件创建mulitindex pandas数据框

时间:2016-12-15 20:41:20

标签: python csv pandas dataframe multi-index

我正在尝试从现有的csv文件创建一个多索引pandas数据框(最终是一个csv文件)。我很难迭代数据框,因为它包含2个以上的维度。我该如何做到这一点?原始csv文件如下所示:

"Products" "Technologies" Region1 Region2 Region3
Prod1       Tech1         16      0       12
Prod2       Tech2         0       12      22
Prod3       Tech3         22      0       36

我正在寻找创建一个如下所示的csv文件:

"Technologies"  "Regions"   Prod1   Prod2   Prod3
Tech1           Region1     16      0       0
Tech1           Region2     0       0       0
Tech1           Region3     12      0       0
Tech2           Region1     0       0       0
Tech2           Region2     0       12      0
Tech2           Region3     0       22      0
Tech3           Region1     0       0       22
Tech3           Region2     0       0       0
Tech3           Region3     0       0       36

1 个答案:

答案 0 :(得分:0)

stackunstack是重塑数据框的有用函数,前者将列索引转换为行索引,后者则相反,也是check this tutorial

# transform regions to a separate column
(df.set_index(["Products", "Technologies"]).stack()

# rename the Region column and remove the name for Products column as it will be unstacked
   .rename_axis(("", "Technologies", "Regions"))

# unstack the product column to headers and reset the index to be columns
   .unstack(level=0, fill_value=0).reset_index())

enter image description here