Question

我想转换元组列表，以便每个元组的第一个元素代表2个不同的列。每个元组的第二个元素应代表与熊猫df中的列相对应的值。

我当前的元组列表：

list_tuples = [('G', 9.8), ('B', 4.2), ('G', 9.6), ('B', 2.3), ('G',7.6), ('B', 3.1)]

所需的输出：

            G        B   
           9.8      4.2      
           9.6      2.3      
           7.6      3.1

我当前拥有的代码无法提供所需的输出：

df = pd.DataFrame(list_tuples, columns=['G', 'B'])

Answer 1

使用defaultdict将元组列表转换为列表字典，然后将其传递给DataFrame构造函数：

from collections import defaultdict

d = defaultdict(list)
for a, b in list_tuples:
    d[a].append(b)
df = pd.DataFrame(d)
print (df)
     G    B
0  9.8  4.2
1  9.6  2.3
2  7.6  3.1

Answer 2

将其转换为字典，创建数据框并使用DataFrame.drop_duplicates和DataFrame.bfill进行清理：

list_tuples = [('G', 9.8), ('B', 4.2), ('G', 9.6), ('B', 2.3), ('G',7.6), ('B', 3.1)]

df = (pd.DataFrame([{col1:val} for col1, val in list_tuples])
        .bfill()
        .drop_duplicates('B')
        .reset_index(drop=True)
     )

     G    B
0 9.80 4.20
1 9.60 2.30
2 7.60 3.10

如何将元组列表转换为pandas数据框，以便每个元组的第一个值代表一列？

2 个答案: