Question

数据框如下所示：

temp = pd.DataFrame({'tic': ['IBM', 'AAPL', 'AAPL', 'IBM', 'AAPL'],
                   'industry': ['A', 'B', 'B', 'A', 'B'],
                    'price': [np.nan, 5, 6, 11, np.nan],
                    'shares':[100, 60, np.nan, 100, np.nan],
                    'dates': pd.to_datetime(['1990-01-01', '1990-01-01', '1990-04-01', 
                                                 '1990-04-01', '1990-08-01'])
                    })
temp.set_index(['tic', 'dates'], inplace=True)
temp.sort_index(inplace=True)

Input [415]: temp
Out[415]: 
                industry  price  shares  
tic  dates                                  
AAPL 1990-01-01        B    5.0    60.0    
     1990-04-01        B    6.0     NaN    
     1990-08-01        B    NaN     NaN    
IBM  1990-01-01        A    NaN   100.0    
     1990-04-01        A   11.0   100.0

我想删除tic s NaN s（组中的所有行）share s。我怎样才能进行过滤：

放弃抽搐：

if＆gt;列share中的n％值缺失。
if＆gt;列price或share中缺少n％的值
如果缺少列toDSum中的n个SUCCESSIVE值

如何按组计算NaNs的百分比？

0 个答案: