向后填充条件

时间:2017-09-12 11:21:39

标签: python pandas dataframe

我想在以下条件下对数据帧的特定列应用后向填充:我有" colum_A"只能假设四个值,称为A,B,C,D,后向填充应如下所示:

if the first not NaN is A, then backward_filling with A;

if the first not NaN is B, then backward_filling with B;

if the first not NaN is C, then backward_filling with B;

if the first not NaN is D, then backward_filling with C;

if the column_A only contains NaN, then backward_filling with D

例如:

输入DF:

colum_A
 NaN
 NaN
 B
 B
 C
 C

输出DF:

colum_A
 B
 B
 C
 C
 D
 D

请,任何帮助将非常感谢。 最好的祝福, 卡罗

1 个答案:

答案 0 :(得分:1)

我认为您需要map bfill条件:

#get mask for back filling NaNs
m = df['colum_A'].isnull()
d = {'A':'A','B':'B','C':'B','D':'C'}
#D if all values NaN
df['colum_B'] = 'D' if m.all() else np.where(m, df['colum_A'].map(d).bfill(),df['colum_A'])
#alternative
#df['colum_B'] = 'D' if m.all() else df['colum_A'].mask(m, df['colum_A'].map(d).bfill())
print (df)
   colum_A colum_B
0      NaN       B
1      NaN       B
2        B       B
3        A       A
4      NaN       B
5        C       C
6        C       C
7      NaN       C
8      NaN       C
9      NaN       C
10       D       D
11       D       D
12       A       A
13       C       C
14     NaN       A
15       A       A
16     NaN     NaN