Question

我正在尝试从系列中减去一个数据框（只有一列）。

df['newcol'] = df['Col A'] - df.filter(regex="INT : Q1")

但是，出现以下错误：

 Exception: Data must be 1-dimensional

df['Col A']成为系列。

df.filter(regex="INT : Q1")成为数据框

您能在这里提出建议吗？

df['Col A']
Out[126]: 
0       0.000000e+00
1       0.000000e+00
2       0.000000e+00
3       0.000000e+00
4       0.000000e+00
5       0.000000e+00
6       1.046203e+05
7      -8.081900e+02
.............
 .......


df.filter(regex="INT : Q1")
Out[127]: 
Quarter_incomeGroup_Final  INT : Q1_2018
  0                           0.000000e+00
  1                           4.997991e+05
  2                           7.915359e+04
  3                           4.837797e+04 
       .............
      ..............

Answer 1

使用Series.rsub进行右减，但输出不是新列，而是DataFrame：

df = df.filter(regex="INT : Q1").rsub(df['Col A'], axis=0)

示例：

df = pd.DataFrame({'INT : Q1 1':[4,5,4,5,5,4],
                   'INT : Q1 2':[7,8,9,4,2,3],
                   'INT : Q1 3':[1,3,5,7,1,0],
                   'Col A':[5,3,6,9,2,4]})

print (df)
   INT : Q1 1  INT : Q1 2  INT : Q1 3  Col A
0           4           7           1      5
1           5           8           3      3
2           4           9           5      6
3           5           4           7      9
4           5           2           1      2
5           4           3           0      4

df = df.filter(regex="INT : Q1").rsub(df['Col A'], axis=0)
print (df)
   INT : Q1 1  INT : Q1 2  INT : Q1 3
0           1          -2           4
1          -2          -5           0
2           2          -3           1
3           4           5           2
4          -3           0           1
5           0           1           4

如果要创建新列-filter可以返回一个或多个列，因此可能的解决方案是通过iloc选择第一列：

df['newcol'] = df.filter(regex="INT : Q1").rsub(df['Col A'], axis=0).iloc[:, 0]

从系列中减去一个数据框（或其唯一的列）

1 个答案: