Pandas resample函数不能在DateTimeIndex上工作

时间:2015-09-14 21:09:41

标签: python pandas

我有一个名为aus的数据框,如下所示:

+--------------+-------------+
|              | link clicks |
+--------------+-------------+
| created_time |             |
| 2015-07-20   |        8600 |
| 2015-07-21   |       11567 |
| 2015-07-22   |        1809 |
| 2015-07-23   |        7032 |
| 2015-07-26   |       23704 |
+--------------+-------------+

我将索引设为DateTimeIndex,如下所示: aus.index = pd.to_datetime(aus.index)

然后我运行这样的支票:type(aus.index),给定的输出为pandas.tseries.index.DatetimeIndex

然后当我尝试将索引重新采样为数周时 aus.index = aus.resample('w', how='sum', axis=1)我遇到以下错误:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-310-3268a3f46a19> in <module>()
----> 1 aus.index = aus.resample('w', how='sum', axis=1)

/usr/local/lib/python2.7/site-packages/pandas/core/generic.pyc in resample(self, rule, how, axis, fill_method, closed, label, convention, kind, loffset, limit, base)
   3264                               fill_method=fill_method, convention=convention,
   3265                               limit=limit, base=base)
-> 3266         return sampler.resample(self).__finalize__(self)
   3267 
   3268     def first(self, offset):

/usr/local/lib/python2.7/site-packages/pandas/tseries/resample.pyc in resample(self, obj)
    100             return self.obj
    101         else:  # pragma: no cover
--> 102             raise TypeError('Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex')
    103 
    104         rs_axis = rs._get_axis(self.axis)

TypeError: Only valid with DatetimeIndex, TimedeltaIndex or PeriodIndex

我之前的类型检查说我有正确的索引,但重采样功能并不这么认为。 有什么想法吗?

1 个答案:

答案 0 :(得分:3)

axis = 1表示它正在尝试重新采样列(不是DatetimeIndex)。

In [11]: df.columns
Out[11]: Index([u'link clicks'], dtype='object')

In [12]: type(df.columns)
Out[12]: pandas.core.index.Index

使用axis = 0:

In [21]: aus.resample('w', how='sum', axis=0)
Out[21]:
              link clicks
created_time
2015-07-26          52712

注意:这是resample的默认值:

In [22]: aus.resample('w', how='sum')
Out[22]:
              link clicks
created_time
2015-07-26          52712