如何根据R中的条件将数据从列复制到另一列?

时间:2018-03-06 13:36:51

标签: r data-science data-cleaning

我的数据框如下所示。

  Funct.Area Environment ServiceType Ticket.Nature SLA.Result..4P. IRIS.Priority Func_Environment
2        FUN         DCF         FUN            SR              OK        Medium          FUN-DCF
3  AME - FIN         DCF         FUN            SR          Defect        Medium    AME - FIN-DCF
4  EMEA -FIN         DCF         FUN            SR              OK        Medium    EMEA -FIN-DCF
5        APS         DCF         APS            SR          Defect        Medium          APS-DCF
6   EMEA -SC         DCF         FUN            SR              OK        Medium     EMEA -SC-DCF
7        SEC         DCF         SEC            SR              OK           Low          SEC-DCF

我需要从字段Funct.Area做一个子集,以便拥有前3个或4个字符,如果它们是“EMEA”或“AME”,则将该字段中的值替换为{ {1}}。

我在StackOverflow中调查了类似的问题,但由于我遇到了一个因素问题,我无法管理复制部分。

如果我有任何方法可以尝试,有人能指导我吗?

感谢。

编辑#1: 按@akrun方法。

Environment

编辑2:

这是对我有用的解决方案。

tickets$Funct.Area <- as.character(tickets$Funct.Area)
i1 <- with(tickets, grepl("^(EMEA|AME)", tickets$Funct.Area))
tickets$Funct.Area[i1] <- tickets$Func_Environment[i1]
head(tickets)
  Funct.Area Environment ServiceType Ticket.Nature SLA.Result..4P. IRIS.Priority Func_Environment
2        FUN         DCF         FUN            SR              OK        Medium          FUN-DCF
3         13         DCF         FUN            SR          Defect        Medium    AME - FIN-DCF
4         65         DCF         FUN            SR              OK        Medium    EMEA -FIN-DCF
5        APS         DCF         APS            SR          Defect        Medium          APS-DCF
6         66         DCF         FUN            SR              OK        Medium     EMEA -SC-DCF
7        SEC         DCF         SEC            SR              OK           Low          SEC-DCF

非常感谢@akrun。

1 个答案:

答案 0 :(得分:0)

我们使用grepl创建逻辑索引(&#39; i1&#39;)来检查字符串开头(^)的字符是否为&#39; EMEA&# 39;或(|)&#34; AME&#34;。使用它,替换&#39; Funct.Area&#39;的元素。使用&#39; Funct_Environment&#39;

i1 <- with(df1, grepl("^(EMEA|AME)", Funct.Area))
df1$Funct.Area[i1] <- df1$Func_Environment[i1] 
df1
#     Funct.Area Environment ServiceType Ticket.Nature SLA.Result..4P. IRIS.Priority Func_Environment
#2           FUN         DCF         FUN            SR              OK        Medium          FUN-DCF
#3 AME - FIN-DCF         DCF         FUN            SR          Defect        Medium    AME - FIN-DCF
#4 EMEA -FIN-DCF         DCF         FUN            SR              OK        Medium    EMEA -FIN-DCF
#5           APS         DCF         APS            SR          Defect        Medium         APS-DCF'
#6  EMEA -SC-DCF         DCF         FUN            SR              OK        Medium     EMEA -SC-DCF
#7           SEC         DCF         SEC            SR              OK           Low          SEC-DCF

数据

df1 <- structure(list(Funct.Area = c("FUN", "AME - FIN", "EMEA -FIN", 
"APS", "EMEA -SC", "SEC"), Environment = c("DCF", "DCF", "DCF", 
"DCF", "DCF", "DCF"), ServiceType = c("FUN", "FUN", "FUN", "APS", 
"FUN", "SEC"), Ticket.Nature = c("SR", "SR", "SR", "SR", "SR", 
"SR"), SLA.Result..4P. = c("OK", "Defect", "OK", "Defect", "OK", 
"OK"), IRIS.Priority = c("Medium", "Medium", "Medium", "Medium", 
"Medium", "Low"), Func_Environment = c("FUN-DCF", "AME - FIN-DCF", 
 "EMEA -FIN-DCF", "APS-DCF'", "EMEA -SC-DCF", "SEC-DCF")), .Names = c("Funct.Area", 
 "Environment", "ServiceType", "Ticket.Nature", "SLA.Result..4P.", 
"IRIS.Priority", "Func_Environment"), class = "data.frame", 
 row.names = c("2", 
 "3", "4", "5", "6", "7"))