为重复行分配唯一值

时间:2017-02-14 22:30:47

标签: r

我想在R

中按ID分配每个重复行的值
df <- data.frame(ID=c(1,1,1,2,2,2,2,2,3,3,4),
            Code = c("A","A","A","B","B","C","C","D","A","A","C"))
> df
   ID Code
1   1    A
2   1    A
3   1    A
4   2    B
5   2    B
6   2    C
7   2    C
8   2    D
9   3    A
10  3    A
11  4    C

我希望输出像这样,检查ID重复,然后分配第二个重复_1等等......

   ID Code Code_n
1   1    A      A
2   1    A    A_1
3   1    A    A_2
4   2    B      B
5   2    B    B_1
6   2    C      C
7   2    C    C_1
8   2    D      D
9   3    A      A
10  3    A    A_1
11  4    C      C

2 个答案:

答案 0 :(得分:9)

您可以使用基数R中的make.unique,如下所示,

with(df, ave(as.character(Code), ID, FUN = make.unique))
#[1] "A"   "A.1" "A.2" "B"   "B.1" "C"   "C.1" "D"   "A"   "A.1" "C"

答案 1 :(得分:1)

或使用dplyr

library(dplyr)
df %>% 
    group_by(ID) %>% 
    mutate(Code_n = make.unique(as.character(Code)))