【发布时间】:2020-06-05 10:53:56
【问题描述】:
我有一个包含多个配对值的数据集 (df1)。一对中的一行是一年(例如,2014),另一行是不同的年份(例如,2013)。对于每一对是 G 列中的一个值。我需要计算其中较高年份的 G 值小于较低年份的 G 值的对数。
这是我对数据集 df1 的输入:
structure(list(Name = c("A.J. Ellis", "A.J. Ellis", "A.J. Pierzynski",
"A.J. Pierzynski", "Aaron Boone", "Adam Kennedy", "Adam Melhuse",
"Adrian Beltre", "Adrian Beltre", "Adrian Gonzalez", "Alan Zinter",
"Albert Pujols", "Albert Pujols"), Age = c(37, 36, 37, 36, 36,
36, 36, 37, 36, 36, 36, 37, 36), Year = c(2018, 2017, 2014, 2013,
2009, 2012, 2008, 2016, 2015, 2018, 2004, 2017, 2016), Tm = c("SDP",
"MIA", "TOT", "TEX", "HOU", "LAD", "TOT", "TEX", "TEX", "NYM",
"ARI", "LAA", "LAA"), Lg = c("NL", "NL", "ML", "AL", "NL", "NL",
"ML", "AL", "AL", "NL", "NL", "AL", "AL"), G = c(66, 51, 102,
134, 10, 86, 15, 153, 143, 54, 28, 149, 152), PA = c(183, 163,
362, 529, 14, 201, 32, 640, 619, 187, 40, 636, 650)), row.names = c(NA,
13L), class = "data.frame")
这是一个显示要检查的行的外观的 tibble: https://www.dropbox.com/s/3nbfi9le568qb3s/grouped-pairs.png?dl=0
这是我用来创建小标题的代码:
df1 %>%
group_by(Name) %>%
filter(n() > 1)
【问题讨论】: