以下是我尝试做的最小例子。我有一个带有多索引的pandas DataFrame,如下所示
import pandas as pd
import numpy as np
arrays = [['bar', 'bar', 'baz', 'baz', 'foo', 'foo', 'qux', 'qux'],
['one', 'two', 'one', 'two', 'one', 'two', 'one', 'two']]
tuples = list(zip(*arrays))
index = pd.MultiIndex.from_tuples(tuples, names=['first', 'second'])
s = pd.DataFrame(np.random.randn(8,2), index=index)
所以我拥有的DataFrame是
0 1
first second
bar one -3.174428 -0.314160
two 0.968316 0.278967
baz one 0.171292 -0.789257
two 1.420621 0.100964
foo one -1.001074 -0.517729
two -0.211823 0.951422
qux one 1.173289 0.313692
two -0.159855 0.149710
我想要的是用索引"第二个"设置所有观察结果。等于2为-1。我想到的是使用.loc,如下所示:
s.loc[(:,'two')]
但.loc不会接受":"操作
有人可以帮忙吗?
答案 0 :(得分:2)
选项1:
In [127]: s.loc[pd.IndexSlice[:, 'two'], :] = -1
In [128]: s
Out[128]:
0 1
first second
bar one -0.581647 0.225254
two -1.000000 -1.000000
baz one 0.705050 -1.414695
two -1.000000 -1.000000
foo one 0.359795 1.468521
two -1.000000 -1.000000
qux one -0.481149 -0.241922
two -1.000000 -1.000000
选项2:
In [137]: s.loc[(slice(None),'two'), :] = -11
In [138]: s
Out[138]:
0 1
first second
bar one 2.144487 0.024400
two -11.000000 -11.000000
baz one -0.177128 -1.088566
two -11.000000 -11.000000
foo one -0.780979 2.701814
two -11.000000 -11.000000
qux one -0.981635 -0.202875
two -11.000000 -11.000000