用上一行中的字符串替换列中的零(Python / Pandas)

时间:2018-07-24 17:17:10

标签: python pandas

我想用同一列上一行的字符串替换0。例如:谢菲尔德(Sheffield)下的0应该读为谢菲尔德(Sheffield)。我正在和熊猫一起工作。

file = file[['Branch', 'Type' ,'total']]
#replace NaN with 0 
file.fillna(0).tail(6)
Out[48]: 
     Branch                     Type  total

394   Sheffield  Sum of Resend to Branch      0
395           0   Number of PV Enquiries     83
396   Wakefield  Sum of Resend to Branch      0
397           0   Number of PV Enquiries     38
398        York  Sum of Resend to Branch      1
399           0   Number of PV Enquiries     59

I have tried:
a) #create a  series for that column and replace
branch = file.iloc[ :, 0]
branch.replace(0, branch(-1))
# why is this series not callable?

b)# I tried a loop in the dataframe
for item in file:
    if "Branch" == 0:
        replace(0, "Branch"[-1])
# I am unsure how to refer to the row above

1 个答案:

答案 0 :(得分:2)

replace与方法ffill一起使用

file_df['Branch'].replace(to_replace='0', method='ffill', inplace=True)

>>> file_df
        Branch                     Type  total
394  Sheffield  Sum of Resend to Branch      0
395  Sheffield   Number of PV Enquiries     83
396  Wakefield  Sum of Resend to Branch      0
397  Wakefield   Number of PV Enquiries     38
398       York  Sum of Resend to Branch      1
399       York   Number of PV Enquiries     59

或者,因为看起来您已经用NaN取代了0,所以您可以省略该步骤而只使用ffill,如果您的原始数据框看起来像:

>>> file_df
        Branch                     Type  total
394  Sheffield  Sum of Resend to Branch      0
395        NaN   Number of PV Enquiries     83
396  Wakefield  Sum of Resend to Branch      0
397        NaN   Number of PV Enquiries     38
398       York  Sum of Resend to Branch      1
399        NaN   Number of PV Enquiries     59

使用:

file_df['Branch'].ffill(inplace=True)

请注意,我调用数据框file_df而不是file是为了不掩盖内置的python