【发布时间】:2018-07-08 14:38:23
【问题描述】:
这篇文章与上一篇文章transform date into dummy variable in R相关,但更复杂。 我有数据
df=structure(list(Data = structure(c(4L, 5L, 6L, 7L, 8L, 9L, 10L,
1L, 2L, 3L), .Label = c("01.01.2018", "02.01.2018", "03.01.2018",
"25.12.2017", "26.12.2017", "27.12.2017", "28.12.2017", "29.12.2017",
"30.12.2017", "31.12.2017"), class = "factor"), Y = 1:10), .Names = c("Data",
"Y"), class = "data.frame", row.names = c(NA, -10L))
我必须将日期转换为虚拟变量。如果日期指的是这个日期,则为 1,否则为 0。
Paweł Kozielski-Romaneczko 提供的解决方案帮助了我。
library(dplyr)
library(lubridate)
library(tidyr)
df %>%
mutate(weekDay = lubridate::dmy(Data) %>% weekdays(),
value = 1) %>%
spread(key=weekDay, value=value, fill=0)
但是现在,我必须添加带有假期的列。 IE。约会是不是假期?
我有辅助数据集,其中指示日期是假期?
df1=structure(list(Data = structure(1:2, .Label = c("01.01.2018",
"08.03.2018"), class = "factor"), name = structure(c(2L, 1L), .Label = c("International Women's Day",
"New Year"), class = "factor")), .Names = c("Data", "name"), class = "data.frame", row.names = c(NA,
-2L))
所以作为输出我需要这个假期
Data Y Mon Tue Wed Thu Fri Sat Sun New Year International Women's Day
25.12.2017 1 1 0 0 0 0 0 0 0 0
26.12.2017 2 0 1 0 0 0 0 0 0 0
27.12.2017 3 0 0 1 0 0 0 0 0 0
28.12.2017 4 0 0 0 1 0 0 0 0 0
29.12.2017 5 0 0 0 0 1 0 0 0 0
30.12.2017 6 0 0 0 0 0 1 0 0 0
31.12.2017 7 0 0 0 0 0 0 1 0 0
01.01.2018 8 1 0 0 0 0 0 0 1 0
02.01.2018 9 0 1 0 0 0 0 0 0 0
03.01.2018 10 0 0 1 0 0 0 0 0 0
如何将假期添加为虚拟变量,名称取自辅助数据集?
附:如果你认为这个话题一定在我上一篇文章中,请告诉我,我会删除它。
【问题讨论】:
标签: r dataframe dplyr lubridate