Question

新手问题。

我的数据框如下所示：

      class   A         B
0         1  3.767809   11.016
1         1  2.808231    4.500
2         1  4.822522    1.008
3         2  5.016933   -3.636
4         2  6.036203   -5.220
5         2  7.234567   -6.696
6         2  5.855065   -7.272
7         4  4.116770   -8.208
8         4  2.628000  -10.296
9         4  1.539184  -10.728
10        3  0.875918  -10.116
11        3  0.569210   -9.072
12        3  0.676379   -7.632
13        3  0.933921   -5.436
14        3  0.113842   -3.276
15        3  0.367129   -2.196
16        1  0.968661   -1.980
17        1  0.160997   -2.736
18        1  0.469383   -2.232
19        1  0.410463   -2.340
20        1  0.660872   -2.484

我想获得类相同的组，例如：

class 1: rows 0..2
class 2: rows 3..6
class 4: rows 7..9
class 3: rows 10..15
class 1: rows 16..20

原因是订单很重要。在要求中，我认为课程4只能在1和2之间，如果在预测之后我们在4之后有2课，则应将其视为{ {1}}。

Answer 1

构建新的Para以识别组

df['group']=df['class'].diff().ne(0).cumsum()

df.groupby('group')['group'].apply(lambda x : x.index)
Out[106]: 
group
1                 Int64Index([0, 1, 2], dtype='int64')
2              Int64Index([3, 4, 5, 6], dtype='int64')
3                 Int64Index([7, 8, 9], dtype='int64')
4    Int64Index([10, 11, 12, 13, 14, 15], dtype='in...
5      Int64Index([16, 17, 18, 19, 20], dtype='int64')
Name: group, dtype: object

如何在熊猫中选择值范围？

1 个答案: