在熊猫中循环查询

时间:2019-09-19 20:13:29

标签: python pandas

你好,我有下面的代码

june = q.query('20190531 < ndate < 20190701').copy()
ytd = q.query('20170101 < ndate < 20190531').copy()
aux = june.merge(ytd, on ='buyer-email')
june['date_diffsku'] = june['buyer-email'].map(aux[aux['target_product_x']!=aux['target_product_y']].groupby('buyer-email').ndate_y.max()).fillna(0)
samei = ytd.groupby(['buyer-email','target_product','nCustomer Paid'],as_index=False).agg({'ndate':'min'})
njune = pd.merge(june,samei[['buyer-email','target_product','ndate','nCustomer Paid']], left_on = ['buyer-email','target_product'], right_on=['buyer-email','target_product'],how ='left')
njune['ndate_y'] = njune['ndate_y'].fillna(0)
njune = njune[njune['target_product']=='a']
njune.to_csv('path/06_19.csv')

我需要迭代查询,以便月份在双方两边都减去1,这样下一次就可以了

june = q.query('20190431 < ndate < 20190601').copy()
ytd = q.query('20170101 < ndate < 20190431').copy()
...
njune.to_csv('path/05_19.csv')

,然后是其余的代码。目的是在达到

值时停止
june = q.query('20170131 < ndate < 20170301').copy()
    ytd = q.query('20170101 < ndate < 20170131').copy()
    ...
    njune.to_csv('path/02_17.csv')

有什么主意我可以循环吗?

1 个答案:

答案 0 :(得分:0)

不确定这是否是最好的,但是我最终创建了3个列表

fa = ['20190531 < ndate < 20190701','20190431 < ndate < 20190601']
af = ['20170101 < ndate < 20190531','20170101 < ndate < 20190431']
bf = [62019,52019]
for a,b,c in zip(fa,af,bf) :

并添加了其余部分-希望有人可以添加更好的方法!