我尝试使用forcats::fct_relevel()
来重新定位mydf中的因子以匹配查找中因子的顺序:
library(dplyr)
library(forcats)
mydf <- data.frame(id=1:10, f=as.factor(c("feb", "mar","mar","mar","jan","jan", "apr", "apr","mar", "jan")))
lookup <- data.frame(f=as.factor(c("jan", "feb", "mar", "apr"))) #the order i want
levels(mydf$f) #defaults to alphabetical
mydf %>% group_by(f) %>% dplyr::summarise(n = n()) #want this in the order of lookup$f
mydf %>%
mutate(f = fct_relevel(f, levels(mydf$f))) %>% #the problem line
group_by(f) %>% dplyr::summarise(n = n())
我知道我可以在fct_relevel()
内手动定义级别,但我无法每次都对其进行硬编码。感谢。
答案 0 :(得分:0)
我的第一个倾向是在进行任何分析之前解决它:
levels(mydf$f) <- lookup$f
...您可以将其嵌入您的管道通话中,例如:
mydf %>% mutate(f = factor(f, levels=lookup$f)) %>% group_by(f) %>% dplyr::summarise(n = n())
否则你可以用一种笨重的方式使用lvls_reorder
:
mydf %>%
mutate(f = lvls_reorder(f, match(lookup$f, levels(f)))) %>%
group_by(f) %>%
dplyr::summarise(n = n())