【问题标题】:ggplot: how to break the x-axis into months when data points are per week?ggplot:当数据点是每周时,如何将 x 轴分成几个月?
【发布时间】:2021-06-15 03:55:12
【问题描述】:

问题:当 x 轴基于连续 200 周时,如何使 x 轴更具可读性?我打算把 x 轴分成几个月。问题是一周的第一天不一定与一个月的第一天匹配。因此,我不知道如何处理重叠(连续两个月放在同一周)。

我正在想象 Covid-19 之前和之后的外科手术。

x 轴对应于自星期一以来的连续周 2017-01-02 (yyyy-mm-dd),范围为 1 - 209。每个geom_point() 对应每周的手术次数。

通常,我会简单地将 x 轴分成更小的范围,例如x 轴中断对应于 3 个月。不幸的是,由于b$cons_week 计算自2017-01-02 以来经过的每个连续星期一,它不一定对应于“月休”(因为一个月的第一天不一定与一周的第一天重合)。因此,我不知道如何打破 x 轴。

我的数据是这样的

> head(b)
# A tibble: 6 x 3
  diagnosis  cons_week corona
  <chr>          <dbl> <fct> 
1 2017-10-19        42 Normal
2 2017-07-11        28 Normal
3 2020-06-30       183 C19   
4 2020-06-27       182 C19   
5 2017-01-04         1 Normal
6 2017-12-07        49 Normal

首先,我计算每周的手术次数:

lin.model <- b %>% 
  group_by(corona, cons_week) %>%
  summarise(n = n()) 

这样

# A tibble: 80 x 3
# Groups:   corona [2]
   corona cons_week     n
   <fct>      <dbl> <int>
 1 C19          173     1
 2 C19          175     1
 3 C19          181     1
 4 C19          182     2

然后

ggplot(lin.model,
       aes(x = cons_week, y = n, color = corona, fill = corona)) +
  geom_point(size = 5, shape = 21) +
  geom_smooth(se = F, method = lm, color = "black", show.legend = F) +
  geom_smooth(lty = 2, show.legend = F) + 
  scale_color_manual(name = "",
                     values = c("#8B3A62", "#6DBCC3"),
                     labels = c("COVID-19", "Normal"),
                     guide = guide_legend(reverse=TRUE)) + 
  scale_fill_manual(name = "",
                    values = alpha(c("#8B3A62", "#6DBCC3"), .25),
                    labels = c("COVID-19", "Normal"),
                    guide = guide_legend(reverse=TRUE)) + 
  scale_x_continuous(name = "",
                     breaks = seq(0, 210, 12)) + 
  scale_y_continuous(name = "",
                     breaks = seq(0, 30, 5), limits = c(0, 30)) + 
  theme(axis.title.y = element_text(color = "grey20", 
                                    size = 17, 
                                    face="bold", 
                                    margin=ggplot2::margin(r=10)),
        axis.line = element_line(colour = "black"),
        axis.text.x = element_text(size = 15),
        axis.text.y = element_text(size = 15),
        panel.grid.major = element_line(colour = "grey90"),
        panel.grid.minor = element_line(colour = "grey90"),
        panel.border = element_blank(),
        panel.background = element_blank(),
        legend.position = "top",
        legend.key = element_rect(fill = "white"),
        legend.text=element_text(size=15))

我想知道,b$diagnosis 可以以某种方式用于破坏 x 轴吗? b$diagnosis对应具体的手术日期。

预期输出

数据

b <- structure(list(diagnosis = c("2017-10-19", "2017-07-11", "2020-06-30", 
"2020-06-27", "2017-01-04", "2017-12-07", "2017-09-18", "2020-07-27", 
"2020-08-28", "2020-12-29", "2018-04-12", "2020-06-20", "2020-08-29", 
"2018-02-05", "2018-01-12", "2017-07-15", "2018-03-07", "2020-02-29", 
"2019-08-24", "2017-08-08", "2018-11-27", "2017-03-15", "2017-05-12", 
"2020-10-22", "2019-08-31", "2017-11-17", "2019-04-17", "2018-11-15", 
"2018-02-08", "2019-08-09", "2019-10-06", "2017-08-30", "2019-05-09", 
"2017-06-05", "2017-10-04", "2018-01-27", "2017-06-16", "2019-03-29", 
"2017-06-16", "2018-07-19", "2020-04-23", "2020-01-31", "2020-06-27", 
"2019-12-11", "2019-08-13", "2017-05-07", "2020-05-08", "2020-09-05", 
"2019-12-18", "2018-07-24", "2017-07-31", "2017-01-23", "2018-09-08", 
"2018-12-18", "2017-08-01", "2019-04-11", "2017-05-12", "2019-03-15", 
"2019-06-12", "2017-05-10", "2020-10-27", "2018-08-26", "2019-06-03", 
"2020-07-31", "2017-12-02", "2018-11-07", "2018-03-23", "2019-08-18", 
"2019-08-30", "2018-07-23", "2018-08-08", "2018-10-10", "2019-05-26", 
"2017-11-18", "2020-07-19", "2017-02-07", "2017-08-15", "2020-01-05", 
"2019-07-28", "2017-05-28", "2017-01-02", "2018-09-25", "2017-03-26", 
"2017-04-24", "2018-03-26", "2020-12-01", "2018-09-27", "2019-09-26", 
"2017-10-06", "2019-01-11", "2020-08-15", "2017-02-06", "2018-06-07", 
"2018-03-15", "2017-12-17", "2017-02-08", "2019-11-02", "2020-12-05", 
"2017-09-16", "2017-06-18"), cons_week = c(42, 28, 183, 182, 
1, 49, 38, 187, 191, 209, 67, 181, 191, 58, 54, 28, 62, 165, 
138, 32, 100, 11, 19, 199, 139, 46, 120, 98, 58, 136, 144, 35, 
123, 23, 40, 56, 24, 117, 24, 81, 173, 161, 182, 154, 137, 18, 
175, 192, 155, 82, 31, 4, 88, 103, 31, 119, 19, 115, 128, 19, 
200, 86, 127, 187, 48, 97, 64, 137, 139, 82, 84, 93, 125, 46, 
185, 6, 33, 157, 134, 21, 1, 91, 12, 17, 65, 205, 91, 143, 40, 
106, 189, 6, 75, 63, 50, 6, 148, 205, 37, 24), corona = structure(c(2L, 
2L, 1L, 1L, 2L, 2L, 2L, 1L, 1L, 1L, 2L, 1L, 1L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 1L, 2L, 2L, 2L, 1L, 1L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 1L, 2L, 
2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 
2L, 2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 
1L, 2L, 2L), .Label = c("C19", "Normal"), class = "factor")), row.names = c(NA, 
-100L), class = c("tbl_df", "tbl", "data.frame"))

【问题讨论】:

    标签: r date ggplot2 plot axis


    【解决方案1】:

    我建议将您的 cons_week 转换为日期,例如:

    lin.model <- b %>% 
      group_by(corona, cons_week) %>%
      summarise(n = n()) %>%
      mutate(cons_week_dt = as.Date("2017-01-02") + cons_week*7)
    

    然后:

    ggplot(lin.model,
           aes(x = cons_week_dt, y = n, color = corona, fill = corona)) +
           ...
           scale_x_date(date_breaks = "6 months", date_labels = "%b%Y", expand = c(0.07, 0)) +
           ...
    

    【讨论】:

    • 嘿 - 看起来很棒。我在复制您的示例时收到此错误 - 知道为什么吗? Invalid input: date_trans works with objects of class Date onlylin.model$cons_week_dtstr() 中看起来像as.Date
    • 您是否将aes 更新为引用cons_week_dt 而不是cons_week
    • 是的,我想通了——非常感谢!
    • 嗨 Jon - 简短的衍生问题:你知道为什么 geom_segment() 没有在 scale_x_date() 上打印吗?我想在2020-03-11(或者只是 2020 年 3 月,我输入:geom_segment(x = as.POSIXct(2017-01-02, origin = "2020-03-11"), xend = as.POSIXct(2017-01-02, origin = "2020-03-11"), y = 2, yend = 10, size=2, color = "red") + ...- 但是,什么也没发生,我没有收到任何错误。
    • 试试geom_segment(x = as.Date("2017-01-02"), xend = as.Date("2017-01-02"), y = 2, yend = 10, size=2, color = "red") +。我尝试将所有内容都保留为日期或日期时间,因为混合并不总是有效。
    猜你喜欢
    • 1970-01-01
    • 2018-03-16
    • 1970-01-01
    • 2021-12-04
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    相关资源
    最近更新 更多