Question

我有一个包含以下数据的csv文件。

created_at               hr         bp_dias      bp_sys      weight     
18/05/2015  12:00:05     57
18/05/2015  12:00:05                 79
18/05/2015  12:00:05                               62
18/05/2015  12:00:05                                            83

我想将它们全部集成到一行中，就像这样

 created_at              hr         bp_dias      bp_sys      weight 
 18/05/2015  12:00:05    57          79            62         83

我尝试使用pandas groupby并尝试根据时间戳对它们进行分组，但这并没有给出我想要的结果。

这是我使用的代码。

  df = pd.read_csv('test.csv',parse_dates=True)
  df['created_at'] = pd.to_datetime(df['created_at'],unit='s')
  df = df.set_index('created_at')    
  df = df.groupby([df.index.year,df.index.month,df.index.day])

  print df.head()

Answer 1

你几乎拥有它，groupby'created_at'并致电max()然后reset_index：

In [165]:    
df.groupby('created_at').max().reset_index()

Out[165]:
           created_at  hr  bp_dias  bp_sys  weight     
0 2015-05-18 12:00:05  57       79      62           83

在相同时间戳下对csv数据进行分组

1 个答案: