我正在尝试使用pandas样式突出显示数据框中某些列中的某些值:
import pandas as pd
import numpy as np
np.random.seed(24)
df = pd.DataFrame({'A': np.linspace(1, 10, 10)})
df = pd.concat([df, pd.DataFrame(np.random.randn(10, 4),
columns=list('BCDE'))],axis=1)
df.iloc[0, 2] = np.nan
def highlight_greater(row):
color=""
if row['B'] > row['C']:
color = 'red'
elif row['D'] > row['E']:
color = 'gray'
background = ['background-color: {}'.format(color) for _ in row]
return background
with open ('out.html','w') as out:
print >> out, df.style.apply(highlight_greater, axis=1).render()
这很好,但与我的objectif并不对应,我只想突出显示B和D列。如果匹配条件,此脚本将突出显示该行中的所有列。 任何想法 ?谢谢
答案 0 :(得分:1)
您可以更改样式的DataFrame的自定义功能:
def highlight_greater(x):
r = 'red'
g = 'gray'
m1 = x['B'] > x['C']
m2 = x['D'] > x['E']
df1 = pd.DataFrame('background-color: ', index=x.index, columns=x.columns)
#rewrite values by boolean masks
df1['B'] = np.where(m1, 'background-color: {}'.format(r), df1['B'])
df1['D'] = np.where(m2, 'background-color: {}'.format(g), df1['D'])
return df1
df.style.apply(highlight_greater, axis=None)