Question

您好我想从数据库中获取一个表，但是包含字段名称，以便我可以在列标题中使用它们。我不一定事先知道所有字段名称的熊猫

所以，如果我的数据库看起来像

table test1

 a | b | c 
---+---+---
 1 | 2 | 3
 1 | 2 | 3
 1 | 2 | 3
 1 | 2 | 3
 1 | 2 | 3

我该怎么做

import psycopg2 as pq
cn = pq.connect('dbname=mydb user=me')
cr = cn.cursor()
cr.execute('SELECT * FROM test1;')
tmp = cr.fetchall()
tmp

这样tmp显示

[('a','b','c'),(1,2,3),(1,2,3),(1,2,3),(1,2,3),(1,2,3)]

由于

Answer 1

如果你想要的是一个数据帧，其中db表中的数据作为其值，而dataframe列名是你从db读入的字段名，那么这应该是你想做的：

import psycopg2 as pq
cn = pq.connect('dbname=mydb user=me')
cr = cn.cursor()
cr.execute('SELECT * FROM test1;')
tmp = cr.fetchall()

# Extract the column names
col_names = []
for elt in cr.description:
    col_names.append(elt[0])

# Create the dataframe, passing in the list of col_names extracted from the description
df = pd.DataFrame(tmp, columns=col_names)

Answer 2

列名称以cr.description[0][0]，cr.description[1][0]等格式提供。如果您想要的格式与您显示的格式完全相同，则需要做一些工作来提取它并将其粘贴在结果集。

Answer 3

你也可以将它映射到看起来更好一些的地方：

cursor.execute(open("blah.sql", "r").read())
data = cursor.fetchall()
cols = list(map(lambda x: x[0], cursor.description))
df = DataFrame(data, columns=cols)

Answer 4

您可以使用两种循环方式来不使用熊猫：

temp = []
for x in result:
    temp2 = {}
    c = 0
    for col in cursor.description:
        temp2.update({str(col[0]): x[c]})
        c = c+1
    temp.append(temp2)
print(temp)

这将打印以下内容：

[{'column1':'foo1','column2':'foo1'},{'column1':'foo2','column2':'foo2'},...]

希望对您有所帮助！干杯

Answer 5

import psycopg2 as pq
cn = pq.connect('dbname=mydb user=me')
cr = cn.cursor()
cr.execute('SELECT * FROM test1;')
tmp = cr.fetchall() #Hi, these are your codes that build a connection to a psql server

cols = []
for col in tmp.description:
    cols.append(col[0]) #Collect all column names into an empty list, cols    
tmp.insert(0, tuple(cols)) #insert elements by list.insert(index, new_item) method

输出为

[('a','b','c'),(1,2,3),(1,2,3),(1,2,3),(1,2,3),(1,2,3)]

Python psycopg2 postgres选择包括字段名称的列

5 个答案: