熊猫:创建一个以元组为键的字典

时间:2017-02-16 14:22:20

标签: python pandas dictionary tuples

鉴于此DataFrame

import pandas as pd
first=[0,1,2,3,4]
second=[10.2,5.7,7.4,17.1,86.11]
third=['a','b','c','d','e']
fourth=['z','zz','zzz','zzzz','zzzzz']
df=pd.DataFrame({'first':first,'second':second,'third':third,'fourth':fourth})
df=df[['first','second','third','fourth']]

   first  second third fourth
0      0   10.20     a      z
1      1    5.70     b     zz
2      2    7.40     c    zzz
3      3   17.10     d   zzzz
4      4   86.11     e  zzzzz

我可以创建一个包含列列表的字典作为值,如下所示:

d = {df.loc[idx, 'first']: [df.loc[idx, 'second'], df.loc[idx, 'third']] for idx in range(df.shape[0])}

但是如何创建一个字典,例如包含firstsecond作为键的元组?

结果将是:

In[1]:d
Out[1]: 
{(0,10.199999999999999): 'a',
 (1,5.7000000000000002): 'b',
 (2,7.4000000000000004): 'c',
 (3,17.100000000000001): 'd',
 (4,86.109999999999999): 'e'}

PS:我怎样才能确保pandas不会搞砸价值? 10.20现已成为10.1999999999 ......

1 个答案:

答案 0 :(得分:3)

您需要set_index创建MultiIndex,然后致电Series.to_dict

a = df.set_index(['first','second']).third.to_dict()
print (a)
{(2, 7.4): 'c', (1, 5.7): 'b', (3, 17.1): 'd', (0, 10.2): 'a', (4, 86.11): 'e'}