我有一个包含4列的数据框,基本上我正在尝试创建另一列并使用if语句返回满足条件的值
If NR/HL1 is not equal to 0, then outputColumn(NR/HL) = NR/HL1
if NR/HL1 is equals to 0, then outputColumn(NR/HL) = NR/HL2
if NR/HL1 is equals to 0 and NR/HL2 is equal to 0, then outputColumn(NR/HL) = NR/HL3
SKU NR/HL1 NR/HL2 NR/HL3 OutputColumn(NR/HL)
123 10 20 0 10
456 0 30 20 30
567 0 0 40 40
890 10 20 50 10
我使用了以下代码,它可以正常运行,但不能100%准确。总是错过一个或另一个条件。如果检查输出条件3的图像是否满足,但它将返回默认值。 NR/HL3 !=0, But still NR/HL ==0
def f(AC_off_trade):
if AC_off_trade['NR/HL1'] != 0:
return AC_off_trade['NR/HL1']
if AC_off_trade['NR/HL1'] == 0:
val = AC_off_trade['NR/HL2']
if AC_off_trade['NR/HL1'] == 0 and AC_off_trade['NR/HL2'] == 0:
return AC_off_trade['NR/HL3']
else:
return 0
AC_off_trade['NR/HL'] = AC_off_trade.apply(f,axis=1)
更新的代码
#defining condition
hl1_equal_0_condition = AC_off_trade["NR/HL1"]==0.0
hl2_equal_0_contition = AC_off_trade["NR/HL2"]==0.0
#default value
AC_off_trade.loc[:,"NR/HL"]=0
#setting values depending on condition
AC_off_trade.loc[~hl1_equal_0_condition, "NR/HL"] = AC_off_trade["NR/HL1"]
AC_off_trade.loc[hl1_equal_0_condition, "NR/HL"] = AC_off_trade["NR/HL2"]
AC_off_trade.loc[hl1_equal_0_condition & hl2_equal_0_contition, "NR/HL"] = AC_off_trade["NR/HL3"]
答案 0 :(得分:0)
首先,您应该尝试避免使用apply
来进行矢量化,因为性能会更好。然后,您可以使用条件以获得所需的输出:
df = pd.DataFrame({"SKU": [123, 456, 567, 890], "NR/HL1" : [10, 0, 0, 10], "NR/HL2": [20, 30, 0, 20],"NR/HL3": [0, 20, 40, 50]})
# Defining conditions
hl1_equal_0_condition = df["NR/HL1"]==0
hl2_equal_0_contition = df["NR/HL2"]==0
# Setting the default value
df.loc[:,"NR/HL"] = 0
# Setting the values deppeding on tje conditions
df.loc[~hl1_equal_0_condition, "NR/HL"] = df["NR/HL1"]
df.loc[hl1_equal_0_condition, "NR/HL"] = df["NR/HL2"]
df.loc[hl1_equal_0_condition & hl2_equal_0_contition, "NR/HL"] = df["NR/HL3"]
输出:
SKU NR/HL1 NR/HL2 NR/HL3 NR/HL
0 123 10 20 0 10
1 456 0 30 20 30
2 567 0 0 40 40
3 890 10 20 50 10