我在酒店预订了大熊猫数据框。每行都是预订,如下所示:
Name Arrival Departure RoomNights
Trent Cotchin 29/10/2017 2/11/2017 4
Dustin Martin 1/11/2017 4/11/2017 3
Alex Rance 2/11/2017 3/11/2017 1
我想使用python进行转换,以便每行成为一个午夜。输出看起来像这样:
Name Arrival Departure RoomNights RoomNight Date
Trent Cotchin 29/10/2017 2/11/2017 4 29/10/2017
Trent Cotchin 29/10/2017 2/11/2017 4 30/10/2017
Trent Cotchin 29/10/2017 2/11/2017 4 31/10/2017
Trent Cotchin 29/10/2017 2/11/2017 4 1/11/2017
Dustin Martin 1/11/2017 4/11/2017 3 1/11/2017
Dustin Martin 1/11/2017 4/11/2017 3 2/11/2017
Dustin Martin 1/11/2017 4/11/2017 3 3/11/2017
Alex Rance 2/11/2017 3/11/2017 1 2/11/2017
这使我可以轻松总计任何给定日/月的房间总数。
答案 0 :(得分:11)
使用:
#convert columns to datetime
df['Arrival'] = pd.to_datetime(df['Arrival'])
df['Departure'] = pd.to_datetime(df['Departure'])
#repeat rows
df = df.loc[df.index.repeat(df['RoomNights'])]
#group by index with transform for date ranges
df['RoomNight Date'] =(df.groupby(level=0)['Arrival']
.transform(lambda x: pd.date_range(start=x.iat[0], periods=len(x))))
#unique default index
df = df.reset_index(drop=True)
print (df)
Name Arrival Departure RoomNights RoomNight Date
0 Trent Cotchin 2017-10-29 2017-11-02 4 2017-10-29
1 Trent Cotchin 2017-10-29 2017-11-02 4 2017-10-30
2 Trent Cotchin 2017-10-29 2017-11-02 4 2017-10-31
3 Trent Cotchin 2017-10-29 2017-11-02 4 2017-11-01
4 Dustin Martin 2017-11-01 2017-11-04 3 2017-11-01
5 Dustin Martin 2017-11-01 2017-11-04 3 2017-11-02
6 Dustin Martin 2017-11-01 2017-11-04 3 2017-11-03
7 Alex Rance 2017-11-02 2017-11-03 1 2017-11-02