熊猫缩编持续时间

时间:2016-08-20 19:39:10

标签: python pandas

我正在尝试计算库存系列的下降持续时间和恢复时间。我可以计算下降,但我正在努力达到每次缩减的持续时间和恢复时间。到目前为止,我有这段代码:

import pandas as pd
import pickle
import xlrd
import numpy as np

np.random.seed(0)
df = pd.Series(np.random.randn(2500)*0.7+0.05, index=pd.date_range('1/1/2000', periods=2500, freq='D'))
df= 100*(1+df/100).cumprod()
df=pd.DataFrame(df)
df.columns = ['close']
df['ret'] = df.close/df.close[0]
df['modMax'] = df.ret.cummax()
df['modDD'] = 1-df.ret.div(df['modMax'])
groups = df.groupby(df['modMax'])
dd = groups['modMax','modDD'].apply(lambda g: g[g['modDD'] == g['modDD'].max()])
top10dd = dd.sort_values('modDD', ascending=False).head(10)
top10dd

这给出了该系列的10个最高下降,但我也想要缩短的持续时间和恢复时间。

1 个答案:

答案 0 :(得分:1)

我解决了以下问题:

def drawdown_group(df,index_list):
    group_max,dd_date = index_list
    ddGroup = df[df['modMax'] == group_max]
    group_length = len(ddGroup)
    group_dd = ddGroup['dd'].max()
    group_dd_length = len(ddGroup[ddGroup.index <= dd_date])
    group_start = ddGroup[0:1].index[0]
    group_end = ddGroup.tail(1).index[0]
    group_rec = group_length - group_dd_length
    #print (group_start,group_end,group_dd,dd_date,group_dd_length,group_rec,group_length)
    return group_start,group_end,group_max,group_dd,dd_date,group_dd_length,group_rec,group_length

dd_col = ('start','end','peak', 'dd','dd_date','dd_length','dd_rec','tot_length')
df_dd = pd.DataFrame(columns = dd_col)
for i in range(1,10):
    index_list = top10dd[i-1:i].index.tolist()[0]
    #print(index_list)
    start,end,peak,dd,dd_date,dd_length,dd_rec,tot_length = drawdown_group(df,index_list)
    #print(start,end,dd,dd_date,dd_length,dd_rec,tot_length)
    df_dd.loc[i-1] = drawdown_group(df,index_list)

制作此表格: enter image description here