Question

我有一个包含“Col1”列的表，如下所示：

| Col1 |

| 2 |

| 4 |

| 3 |

我需要创建一个新列“Col2”。此后的表应如下所示：

| Col1 | Col2 |

| 2 | 1 |

| 2 | 2 |

| 4 | 1 |

| 4 | 2 |

| 4 | 3 |

| 4 | 4 |

| 3 | 1 |

| 3 | 2 |

| 3 | 3 |

| 3 | 1 |

| 3 | 2 |

| 3 | 3 |

是否可以这样做，如果我连续有相同的值，代码从1开始？例如3。

| 3 | 1 |

| 3 | 2 |

| 3 | 3 |

| 3 | 1 |

| 3 | 2 |

| 3 | 3 |

Answer 1

让我们尝试这个pandas解决方案，而不需要循环：

df2 = df.assign(Col2=df.groupby('Col1')['Col1'].cumcount().mod(df['Col1']).add(1))
print(df2)

输出：

    Col1  Col2
0      2     1
1      2     2
2      4     1
3      4     2
4      4     3
5      4     4
6      3     1
7      3     2
8      3     3
9      3     1
10     3     2
11     3     3

Answer 2

import pandas as pd
df = pd.DataFrame({'Col1':[2,2,4,4,4,4,3,3,3,3,3,3]})
i = 0
Col2 = []
Col1 = df.Col1

#Construct Col2 
while i < (len(Col1)):
    Col2.extend(list(range(1,Col1[i]+1)))
    i = len(Col2)
#Add Col2 to Dataframe
df['Col2'] = Col2

大熊猫中的一些计算，添加了一列

2 个答案: