【发布时间】:2017-07-12 13:52:26
【问题描述】:
我有许多不同作家的活动数据,数据包括他们写作生涯的start.date 和end.date
library("tidyverse")
writing_period_data <- tribble(
~start.date, ~end.date, ~writer, ~topic,
12, 18, "a", sample(letters[10:20],1),
14, 20, "b", sample(letters[10:20],1),
17, 22, "c", sample(letters[10:20],1),
15, 30, "a", sample(letters[10:20],1)
)
我想最终创建这个数据的joyplot,这需要我生成这个数据结构:
desired_output <- tribble(
~year, ~count, ~writer,
12, 1, "a",
13, 1, "a",
14, 1, "a",
14, 1, "b",
15, 2, "a",
15, 1, "b",
16, 2, "a",
16, 1, "b",
17, 2, "a",
17, 1, "b",
17, 1, "c",
18, 2, "a",
18, 1, "b",
18, 1, "c",
19, 1, "a",
19, 1, "b",
19, 1, "c",
20, 1, "a",
20, 1, "b",
20, 1, "c",
21, 1, "a",
21, 1, "c",
22, 1, "a",
22, 1, "c",
23, 1, "a",
24, 1, "a"
)
我们可以从这张图表中看到作者在感兴趣的时间段内的分布:
desired_output %>%
ggplot(aes(x = year, y = count, fill = writer)) + geom_col()
如何从writing_period_data 生成desired_output?
【问题讨论】: