我有以下格式的csv文件,
,col1,col2,col3
row1,23,42,77
row2,25,39,87
row3,48,67,53
row4,14,48,66
我需要把它读成两个键的字典,这样
dict1['row1']['col2'] = 42
dict1['row4']['col3'] = 66
如果我尝试将csv.DictReader用于默认选项
with open(filePath, "rb" ) as theFile:
reader = csv.DictReader(theFile, delimiter=',')
for line in reader:
print line
我得到以下输出
{'': 'row1', 'col2': '42', 'col3': '77', 'col1': '23'}
{'': 'row2', 'col2': '39', 'col3': '87', 'col1': '25'}
{'': 'row3', 'col2': '67', 'col3': '53', 'col1': '48'}
{'': 'row4', 'col2': '48', 'col3': '66', 'col1': '14'}
我不确定如何处理此输出以创建我感兴趣的词典类型。
为了完整起见,如果您可以解决如何将字典写回具有上述格式的csv文件,那么它也会有所帮助
答案 0 :(得分:14)
使用CSV模块:
import csv
dict1 = {}
with open("test.csv", "rb") as infile:
reader = csv.reader(infile)
headers = next(reader)[1:]
for row in reader:
dict1[row[0]] = {key: int(value) for key, value in zip(headers, row[1:])}
答案 1 :(得分:6)
即使有点矫枉过正,你也可以使用pandas。专家认为,几乎没有任何代码可以获得预期的结果。
# Reading the file
df = pd.read_csv('tmp.csv', index_col=0)
# Creating the dict
d = df.transpose().to_dict(orient='series')
print(d['row1']['col2'])
42
答案 2 :(得分:1)
使用csv
模块解析输入文件的格式并不方便。我将分别解析标题,然后逐行解析其余部分,按,
拆分,一直剥离和制作字典。工作代码:
from pprint import pprint
d = {}
with open("myfile.csv") as f:
headers = [header.strip() for header in next(f).split(",")[1:]]
for line in f:
values = [value.strip() for value in line.split(",")]
d[values[0]] = dict(zip(headers, values[1:]))
pprint(d)
打印:
{'row1': {'col1': '23', 'col2': '42', 'col3': '77'},
'row2': {'col1': '25', 'col2': '39', 'col3': '87'},
'row3': {'col1': '48', 'col2': '67', 'col3': '53'},
'row4': {'col1': '14', 'col2': '48', 'col3': '66'}}