计算group_by中行之间的差异分数

时间:2019-02-26 06:14:04

标签: r dplyr difference

我正在处理一些比赛数据,并希望计算出每场比赛每个团队之间的目标差异。

我可以得到第二支球队的得分差(在“差异”列中),但是我无法计算出如何计算第一支球队的进球差。这应该是第二支球队的目标差的倒数(即,在示例数据集中,差异列表中的“咆哮者”应该有1,而“罢工”应该有-1)。

library(dplyr)

dat <-
  structure(
    list(
      Match = c(1, 1, 2, 2, 3, 3),
      Team = c("Growlers",
               "Rollers", "Strike", "Bandits", "Cats", "Blues"),
      Goals = c(1,0, 0, 1, 1, 2)
    ),
    row.names = c(NA,-6L),
    groups = structure(
      list(
        Match = c(895825, 895826, 895827),
        .rows = list(1:2, 3:4,
                     5:6)
      ),
      row.names = c(NA,-3L),
      class = c("tbl_df", "tbl",
                "data.frame"),
      .drop = TRUE
    ),
    class = c("grouped_df", "tbl_df",
              "tbl", "data.frame")
  )

dat %>% 
    group_by(Match) %>% 
    mutate(diff = Goals - lag(Goals))
#> # A tibble: 6 x 4
#> # Groups:   Match [3]
#>   Match Team     Goals  diff
#>   <dbl> <chr>    <dbl> <dbl>
#> 1     1 Growlers     1    NA
#> 2     1 Rollers      0    -1
#> 3     2 Strike       0    NA
#> 4     2 Bandits      1     1
#> 5     3 Cats         1    NA
#> 6     3 Blues        2     1

reprex package(v0.2.0)于2019-02-26创建。

1 个答案:

答案 0 :(得分:1)

一种快速而肮脏的方法是显式计算团队1和团队2的得分,如下所示:

dat %>% 
  group_by(Match) %>% 
  mutate(
    diff = c(
      Goals[1] - Goals[2],
      Goals[2] - Goals[1] 
    )
  )

#> # A tibble: 6 x 4
#> # Groups:   Match [3]
#>   Match Team     Goals  diff
#>   <dbl> <chr>    <dbl> <dbl>
#> 1     1 Growlers     1     1
#> 2     1 Rollers      0    -1
#> 3     2 Strike       0    -1
#> 4     2 Bandits      1     1