我正在处理一些比赛数据,并希望计算出每场比赛每个团队之间的目标差异。
我可以得到第二支球队的得分差(在“差异”列中),但是我无法计算出如何计算第一支球队的进球差。这应该是第二支球队的目标差的倒数(即,在示例数据集中,差异列表中的“咆哮者”应该有1
,而“罢工”应该有-1
)。
library(dplyr)
dat <-
structure(
list(
Match = c(1, 1, 2, 2, 3, 3),
Team = c("Growlers",
"Rollers", "Strike", "Bandits", "Cats", "Blues"),
Goals = c(1,0, 0, 1, 1, 2)
),
row.names = c(NA,-6L),
groups = structure(
list(
Match = c(895825, 895826, 895827),
.rows = list(1:2, 3:4,
5:6)
),
row.names = c(NA,-3L),
class = c("tbl_df", "tbl",
"data.frame"),
.drop = TRUE
),
class = c("grouped_df", "tbl_df",
"tbl", "data.frame")
)
dat %>%
group_by(Match) %>%
mutate(diff = Goals - lag(Goals))
#> # A tibble: 6 x 4
#> # Groups: Match [3]
#> Match Team Goals diff
#> <dbl> <chr> <dbl> <dbl>
#> 1 1 Growlers 1 NA
#> 2 1 Rollers 0 -1
#> 3 2 Strike 0 NA
#> 4 2 Bandits 1 1
#> 5 3 Cats 1 NA
#> 6 3 Blues 2 1
由reprex package(v0.2.0)于2019-02-26创建。
答案 0 :(得分:1)
一种快速而肮脏的方法是显式计算团队1和团队2的得分,如下所示:
dat %>%
group_by(Match) %>%
mutate(
diff = c(
Goals[1] - Goals[2],
Goals[2] - Goals[1]
)
)
#> # A tibble: 6 x 4
#> # Groups: Match [3]
#> Match Team Goals diff
#> <dbl> <chr> <dbl> <dbl>
#> 1 1 Growlers 1 1
#> 2 1 Rollers 0 -1
#> 3 2 Strike 0 -1
#> 4 2 Bandits 1 1