数据框重采样总和

时间:2020-06-16 11:11:21

标签: dataframe

我正尝试将我的数据帧从15分钟重新采样到1小时并求和该特定小时的值:

df['DateTime'] = df["Data"] +" "+ df["Hora"]
df['DateTime'] = df['DateTime'].str.replace('24:00', '00:00')
df['DateTime'] = pd.to_datetime(df.DateTime)
df=df.set_index('DateTime')

df=df.drop(columns=['Data','Hora'])

res=df.resample('60min').sum()

尽管如此,结果还是不正确的:

df:

2019-01-01 00:15:00,6.0,3.0,0.0
2019-01-01 00:30:00,6.0,3.0,0.0
2019-01-01 00:45:00,6.0,4.0,0.0
2019-01-01 01:00:00,8.0,3.0,0.0
2019-01-01 01:15:00,10.0,3.0,0.0
2019-01-01 01:30:00,6.0,3.0,0.0
2019-01-01 01:45:00,5.0,0.0,0.0
2019-01-01 02:00:00,5.0,0.0,0.0
2019-01-01 02:15:00,5.0,1.0,0.0
2019-01-01 02:30:00,5.0,0.0,0.0
2019-01-01 02:45:00,6.0,2.0,0.0
2019-01-01 03:00:00,5.0,1.0,0.0

res:

2019-01-01 00:00:00,23.0,11.0,0.0 -应该是:(6 + 6 + 6 + 8 = 26,3 + 3 + 4 + 3 = 13) 2019-01-01 01:00:00,29.0,9.0,0.0 -应该是:(26,6) 2019-01-01 02:00:00,21.0,3.0,0.0 -应该是:(5 + 5 + 6 + 5 = 21,1 + 0 + 2 + 1 = 4)

我这样做

res=df.resample('60min').min()

他给我显示的是5和1,而不是第一个小时的6和3!

2019-01-01 00:00:00,**5.0,1**.0,0.0

使用.max()

2019-01-01 00:00:00,6.0, 4 .0,0.0

4个错误...

0 个答案:

没有答案