如何在Rbased另一个表格中重新考虑因素

时间:2017-03-08 00:51:07

标签: r dplyr factors

我尝试使用forcats::fct_relevel()来重新定位mydf中的因子以匹配查找中因子的顺序:

library(dplyr)
library(forcats)

mydf <- data.frame(id=1:10, f=as.factor(c("feb", "mar","mar","mar","jan","jan", "apr", "apr","mar", "jan")))

lookup <- data.frame(f=as.factor(c("jan", "feb", "mar", "apr"))) #the order i want

levels(mydf$f) #defaults to alphabetical

mydf %>% group_by(f) %>% dplyr::summarise(n = n()) #want this in the order of lookup$f

mydf %>% 
  mutate(f = fct_relevel(f, levels(mydf$f))) %>% #the problem line
  group_by(f) %>% dplyr::summarise(n = n())

我知道我可以在fct_relevel()内手动定义级别,但我无法每次都对其进行硬编码。感谢。

1 个答案:

答案 0 :(得分:0)

我的第一个倾向是在进行任何分析之前解决它:

levels(mydf$f) <- lookup$f

...您可以将其嵌入您的管道通话中,例如:

mydf %>% mutate(f = factor(f, levels=lookup$f)) %>% group_by(f) %>% dplyr::summarise(n = n())

否则你可以用一种笨重的方式使用lvls_reorder

mydf %>% 
  mutate(f = lvls_reorder(f, match(lookup$f, levels(f)))) %>%
  group_by(f) %>%
  dplyr::summarise(n = n())