Question

我正在使用Pyomo，并且尝试输入一些参数的4-D数据。

我在Excel电子表格中保存了如下数据：

可以在此处找到原始数据的链接：

我想用Python导入数据，并将元组中的每个列索引和标头值作为字典的键，并将值作为字典的值。

基本上，预期输出应如下所示：

p = {('Heat', 'Site 1', 1, 1): 14,
     ('Heat', 'Site 1', 1, 2): 16,
     ('Heat', 'Site 1', 1, 3): 10,
     ('Heat', 'Site 1', 2, 1): 13,
     ('Heat', 'Site 1', 2, 2): 13,
     ('Heat', 'Site 1', 2, 3): 13,
     ('Cool', 'Site 1', 1, 1): 5,
     ('Heat', 'Site 1', 1, 2): 6,
...
     ('Elec', 'Site 2', 2, 1): 11,
     ('Elec', 'Site 2', 2, 2): 15,
     ('Elec', 'Site 2', 2, 3): 15}

我的想法是先使用熊猫导入excel文件，然后再使用to_dict方法。

我的工作如下：

import pandas as pd
Loads = pd.read_excel("Time_series_parameters.xlsx", index_col=[0,1], header = [0,1])

这很好用，我能够获得一个带有两个索引列和两个标题行的数据框：

       Heat   Cool   Elec   Heat   Cool   Elec
Time Site 1 Site 1 Site 1 Site 2 Site 2 Site 2
1 1      14      5     13     10     20     14
  2      16      6     11     10     14     10
  3      10      7     14     11     18     11
2 1      13      8     14     20     19     11
  2      13      7     11     14     15     15
  3      13      6     13     12     19     15

但是，我从那里尝试达到预期结果的任何操作都失败了……to_dict方法中的所有设置都没有给我预期的结果。

因此，如果有人可以在这里提供帮助，我将不胜感激。

Answer 1

我对此的解决方案是：

import pandas as pd
Loads = pd.read_excel("Time_series_parameters.xlsx", index_col=[0, 1], header=[0, 1])

out = {}
for index, inner in Loads.iteritems():
    for sec_index, value in inner.iteritems():
        out[index[0], index[1], sec_index[0], sec_index[1]] = value

结果输出为：

{('Heat', 'Site 1', 1, 1): 14,
 ('Cool', 'Site 1', 1, 1): 5,
 ('Elec', 'Site 1', 1, 1): 13,
 ('Heat', 'Site 2', 1, 1): 10,
 ('Cool', 'Site 2', 1, 1): 20,
 ('Elec', 'Site 2', 1, 1): 14,
 ('Heat', 'Site 1', 1, 2): 16,
 ('Cool', 'Site 1', 1, 2): 6,
 ('Elec', 'Site 1', 1, 2): 11,
 ('Heat', 'Site 2', 1, 2): 10,
 ...

Answer 2

我还找到了另一个答案，该答案实际上使用其他pandas功能可以达到相同的结果。该代码可以在下面看到：

Loads = pd.read_excel("Time_series_parameters.xlsx", sheet_name = "Loads", index_col=[0,1], header=[0, 1])
Loads = Loads.stack().stack()
Loads = Loads.reorder_levels([3,2,0,1])
p = Loads.to_dict()

输出再次如下所示：

{('Cool', 'Site 1', 1, 1): 18,
 ('Elec', 'Site 1', 1, 1): 18,
 ('Heat', 'Site 1', 1, 1): 19,
 ('Cool', 'Site 2', 1, 1): 17,
...

通过熊猫将具有多个列索引和标题行的Excel放入Python字典

2 个答案: