【发布时间】:2019-06-10 17:12:41
【问题描述】:
我有一个公司董事数据集。例如,对于 2005 年的 X 公司,他们有 3 名董事。因此,对于公司 x 在 2005 年有三个观察结果。每个董事都有一个唯一的 ID。现在我只想过滤那些今年董事和往年董事相同的观察结果(它们整体相同;如果今年的成员包括 1 个新成员和往年的 2 个老成员;我不想要这些观察结果)。每个董事都有一个唯一的 ID。此外,每家公司都有一个唯一的 ID,例如 ISIN。
只有一家公司的数据集看起来像这样 -
ISIN year DirectorName DirectorID
1 US9898171015 2006 Thomas (Tom) E Davin 2247441792
2 US9898171015 2006 Matthew (Matt) L Hyde 4842568996
3 US9898171015 2007 James (Jim) M Weber 3581636766
4 US9898171015 2007 Matthew (Matt) L Hyde 4842568996
5 US9898171015 2007 David (Dave) M DeMattei 759047198
6 US9898171015 2008 James (Jim) M Weber 3581636766
7 US9898171015 2008 Matthew (Matt) L Hyde 4842568996
8 US9898171015 2008 David (Dave) M DeMattei 759047198
9 US9898171015 2009 William (Bill) Milroy Barnum Jr 20462211719
10 US9898171015 2009 James (Jim) M Weber 3581636766
11 US9898171015 2009 Matthew (Matt) L Hyde 4842568996
12 US9898171015 2009 David (Dave) M DeMattei 759047198
13 US9898171015 2010 William (Bill) Milroy Barnum Jr 20462211719
14 US9898171015 2010 James (Jim) M Weber 3581636766
15 US9898171015 2010 Matthew (Matt) L Hyde 4842568996
16 US9898171015 2011 Sarah (Sally) Gaines McCoy 11434863691
17 US9898171015 2011 William (Bill) Milroy Barnum Jr 20462211719
18 US9898171015 2011 James (Jim) M Weber 3581636766
19 US9898171015 2011 Matthew (Matt) L Hyde 4842568996
20 US9898171015 2012 Sarah (Sally) Gaines McCoy 11434863691
21 US9898171015 2012 Ernest R Johnson 40425210975
22 US9898171015 2013 Sarah (Sally) Gaines McCoy 11434863691
23 US9898171015 2013 Ernest R Johnson 40425210975
24 US9898171015 2013 Travis D Smith 53006212569
25 US9898171015 2014 Sarah (Sally) Gaines McCoy 11434863691
26 US9898171015 2014 Ernest R Johnson 40425210975
27 US9898171015 2014 Travis D Smith 53006212569
28 US9898171015 2015 Kalen F Holmes 11051172801
29 US9898171015 2015 Sarah (Sally) Gaines McCoy 11434863691
30 US9898171015 2015 Ernest R Johnson 40425210975
31 US9898171015 2015 Travis D Smith 53006212569
32 US9898171015 2016 Sarah (Sally) Gaines McCoy 11434863691
33 US9898171015 2016 Ernest R Johnson 40425210975
34 US9898171015 2016 Travis D Smith 53006212569
35 US9898171015 2017 Sarah (Sally) Gaines McCoy 11434863691
36 US9898171015 2017 Scott Andrew Bailey 174000000000
37 US9898171015 2017 Ernest R Johnson 40425210975
38 US9898171015 2017 Travis D Smith 53006212569
我试过这些代码
endo <- ac %>%
group_by(ISIN) %>%
filter(DirectorID == lag (DirectorID, 1))
使用上面的代码后,我得到了以下结果。
ISIN year DirectorName DirectorID
1 US9898171015 2007 Matthew (Matt) L Hyde 4842568996
2 US9898171015 2008 James (Jim) M Weber 3581636766
3 US9898171015 2008 Matthew (Matt) L Hyde 4842568996
4 US9898171015 2008 David (Dave) M DeMattei 759047198
5 US9898171015 2009 James (Jim) M Weber 3581636766
6 US9898171015 2009 Matthew (Matt) L Hyde 4842568996
7 US9898171015 2009 David (Dave) M DeMattei 759047198
8 US9898171015 2010 William (Bill) Milroy Barnum Jr 20462211719
9 US9898171015 2010 James (Jim) M Weber 3581636766
10 US9898171015 2010 Matthew (Matt) L Hyde 4842568996
11 US9898171015 2011 William (Bill) Milroy Barnum Jr 20462211719
12 US9898171015 2011 James (Jim) M Weber 3581636766
13 US9898171015 2011 Matthew (Matt) L Hyde 4842568996
14 US9898171015 2012 Sarah (Sally) Gaines McCoy 11434863691
15 US9898171015 2013 Sarah (Sally) Gaines McCoy 11434863691
16 US9898171015 2013 Ernest R Johnson 40425210975
17 US9898171015 2014 Sarah (Sally) Gaines McCoy 11434863691
18 US9898171015 2014 Ernest R Johnson 40425210975
19 US9898171015 2014 Travis D Smith 53006212569
20 US9898171015 2015 Sarah (Sally) Gaines McCoy 11434863691
21 US9898171015 2015 Ernest R Johnson 40425210975
22 US9898171015 2015 Travis D Smith 53006212569
23 US9898171015 2016 Sarah (Sally) Gaines McCoy 11434863691
24 US9898171015 2016 Ernest R Johnson 40425210975
25 US9898171015 2016 Travis D Smith 53006212569
26 US9898171015 2017 Sarah (Sally) Gaines McCoy 11434863691
27 US9898171015 2017 Ernest R Johnson 40425210975
28 US9898171015 2017 Travis D Smith 53006212569
如果手动检查第一个数据(使用代码之前的数据),显然只有2007年和2008年; 2013 年和 2014 年,董事会组成相同。所以我只想要这些观察结果。
但是第二个数据(使用代码后的数据)没有产生预期的结果。
预期的结果在这里 -
ISIN year DirectorName DirectorID
1 US9898171015 2007 James (Jim) M Weber 3581636766
2 US9898171015 2007 Matthew (Matt) L Hyde 4842568996
3 US9898171015 2007 David (Dave) M DeMattei 759047198
4 US9898171015 2008 James (Jim) M Weber 3581636766
5 US9898171015 2008 Matthew (Matt) L Hyde 4842568996
6 US9898171015 2008 David (Dave) M DeMattei 759047198
7 US9898171015 2013 Sarah (Sally) Gaines McCoy 11434863691
8 US9898171015 2013 Ernest R Johnson 40425210975
9 US9898171015 2013 Travis D Smith 53006212569
10 US9898171015 2014 Sarah (Sally) Gaines McCoy 11434863691
11 US9898171015 2014 Ernest R Johnson 40425210975
12 US9898171015 2014 Travis D Smith 53006212569
感谢您的帮助。
【问题讨论】:
-
您可能需要提供可重现/代码示例以更清楚地说明。如果您有兴趣,这里有一些指导:stackoverflow.com/questions/5963269/…