我有以下pandas DataFrame,我试图做一点清理。在我的情况下,我收到了产品' a'原始十进制形式的数据,而我需要百分比与其他产品的格式一致。
仅在 product = a 的情况下,如何在我的数据框中将 success_rate 和 market_penetration_rate 缩放100?
import pandas as pd
df = pd.DataFrame({'product' : ['a', 'a', 'c', 'c', 'd', 'b', 'a', 'b', 'c'],
'success_rate' : [0.2, 1.0, 67.0, 71.5, 23.2, 71.0, 0.44, 59.3, 12.7],
'market_penetration_rate' : [0.82, 0.64, 77.5, 12.5, 22.5, 88.0, 0.34, 98.2, 87.4]})
+----------+--------------+-------------------------+ | product | success_rate | market_penetration_rate | | | | | | a | 0.2 | 0.82 | | | | | | a | 1 | 0.64 | | | | | | c | 67 | 77.5 | | | | | | c | 71.5 | 12.5 | | | | | | d | 23.2 | 22.5 | | | | | | b | 71 | 88 | | | | | | a | 0.44 | 0.34 | | | | | | b | 59.3 | 98.2 | | | | | | c | 12.7 | 87.4 | +----------+--------------+-------------------------+
答案 0 :(得分:7)
In [7]: print df.loc[df['product']=='a', ['market_penetration_rate', 'success_rate']] * 100
market_penetration_rate success_rate
0 82 20
1 64 100
6 34 44
或者,如果您想要就地扩展,
In [8]: df.loc[df['product']=='a', ['market_penetration_rate', 'success_rate']] *= 100
答案 1 :(得分:1)
试试这个:
df.apply(lambda row: row[['success_rate', 'market_penetration_rate']]*100 if row['product'] == 'a'
else row[['success_rate', 'market_penetration_rate']], axis=1)