将元组列表的列表转换为pandas数据帧

时间:2017-06-06 14:54:27

标签: python dataframe

我正在尝试将元组列表的列表转换为pandas数据帧,但无法弄清楚如何执行此操作。我的地址结构如下:

addresses = [
 [('the vicars inn', 'house'), ('68', 'house_number'), ('church lane', 'road'), ('arlesey', 'city'), ('beds', 'house')],
 [('the old oak', 'house'), ('85', 'house_number'), ('church lane', 'road'), ('arlesey', 'city'), ('beds', 'house')],
 [('adj', 'road'), ('85', 'house_number'), ('high street', 'road'), ('arlesey', 'city'), ('beds', 'house')],
 [('arlesey community centre', 'house'), ('high street', 'road'), ('arlesey', 'city'), ('beds', 'house')],
 [('arlesey community centre', 'house'), ('high street', 'road'), ('arlesey', 'city'), ('beds', 'house')]
]

理想情况下,我需要返回一个数据框,如:

      city           house             house_number       road
0     arlesey        the vicars inn    68                 church lane
1     arlesey        the old oak       85                 church lane

到目前为止,我所尝试的是转动表格,但它没有产生预期的结果:

pd.DataFrame.from_records(addresses[0]).pivot(columns=1, values=0)

有没有人对我应该考虑的方法有任何指导,以实现我理想的数据帧?

萨姆

1 个答案:

答案 0 :(得分:3)

您可以将每条记录转换为字典,然后使用DataFrame.from_records

pd.DataFrame.from_records([{k: v for v, k in row} for row in addresses])

#      city house   house_number    road
#0  arlesey beds              68    church lane
#1  arlesey beds              85    church lane
#2  arlesey beds              85    high street
#3  arlesey beds             NaN    high street
#4  arlesey beds             NaN    high street