我编写了一个程序,我添加了两列并将答案写入CSV文件,但是当我只想编写选择的列时,我收到错误。 这是我的逻辑:
import pandas as pd
df = pd.DataFrame({'A' : ['foo', 'bar', 'foo', 'bar',
'foo', 'bar', 'foo', 'bar'],
'B' : ['one', 'one', 'two', 'two',
'two', 'two', 'one', 'two'],
'C' : [56, 2, 3, 4, 5, 6, 0, 2],
'D' : [51, 2, 3, 4, 5, 6, 0, 2]})
grouped = df.groupby(['A', 'B']).sum()
grouped['sum'] = (grouped['C'] / grouped['D'])
# print (grouped[['sum']])
a = pd.DataFrame(grouped)
a.to_csv("C:\\Users\\test\\Desktop\\test.csv", index=False, cols=('A','B','sum'))
我怎么才能写出A,B和Sum列的数据。 我收到以下错误
Traceback (most recent call last):
File "C:\Users\test\Desktop\eclipse\yuy\group.py", line 19, in <module>
a.to_csv("C:\\Users\\test\\Desktop\\test.csv", index=False, cols=('A','B','sum'))
File "C:\Python27\lib\site-packages\pandas\core\frame.py", line 1126, in to_csv
date_format=date_format)
File "C:\Python27\lib\site-packages\pandas\core\format.py", line 992, in __init__
self.obj = self.obj.loc[:, cols]
File "C:\Python27\lib\site-packages\pandas\core\indexing.py", line 1018, in __getitem__
return self._getitem_tuple(key)
File "C:\Python27\lib\site-packages\pandas\core\indexing.py", line 595, in _getitem_tuple
self._has_valid_tuple(tup)
File "C:\Python27\lib\site-packages\pandas\core\indexing.py", line 106, in _has_valid_tuple
if not self._has_valid_type(k, i):
File "C:\Python27\lib\site-packages\pandas\core\indexing.py", line 1100, in _has_valid_type
(key, self.obj._get_axis_name(axis)))
KeyError: "[['A', 'B', 'sum']] are not in ALL in the [columns]"
答案 0 :(得分:12)
A和B不再是列,因为您调用了groupby(['A', 'B'])
。相反,他们都是一个索引。尝试省略index=False
,如下所示:
a.to_csv("test.csv", cols=['sum'])
答案 1 :(得分:1)
如果要将其写为excel文件,请使用此命令
writer = pd.ExcelWriter('output.xlsx')
data_frame.to_excel(writer,'Sheet1')
writer.save()