【问题标题】:Character format cannot change to Date format字符格式无法更改为日期格式
【发布时间】:2017-12-26 22:09:43
【问题描述】:

我有一个数据,有一个名为Date的列,我将数据输入到R中。
这是我的数据:

unique(data$Date)
 [1] ""           "2016/12/20" "2016/12/27" "2017/1/7"   "2017/1/27"  "2017/2/1"   "2017/2/2"   "2017/2/5"   "2017/2/6"   "2017/2/7"  
[11] "2017/2/8"   "2017/2/10"  "2017/2/11"  "2017/2/13"  "2017/2/14"  "2017/2/15"  "2017/2/17"  "2017/2/16"  "2017/2/24"  "2017/2/19" 
[21] "2017/2/21"  "2017/2/20"  "2017/2/26"  "2017/2/22"  "2017/3/2"   "2017/2/25"  "2017/2/28"  "2017/3/1"   "2017/3/4"   "2017/3/5"  
[31] "2017/3/6"   "2017/3/10"  "2017/3/8"   "2017/3/9"   "2017/3/11"  "2017/3/12"  "2017/3/13"  "2017/3/15"  "2017/3/29"  "2017/5/13" 
[41] "2015/10/5"  "2016/2/22"  "2015/3/6"   "2015/3/7"   "2015/10/15" "2015/3/9"   "2016/1/30"  "2015/10/29" "2015/10/24" "2015/10/17"
[51] "2016/1/8"   "2015/9/24"  "2016/2/15"  "2015/12/8"  "2015/12/10" "2016/2/6"   "2015/11/29" "2016/1/23"  "2015/10/11" "2016/2/16" 
[61] "2015/9/28"  "2016/1/29"  "2015/11/27" "2015/10/12" "2015/11/1"  "2015/11/16" "2015/10/10" "2015/11/30" "2016/1/2"   "2016/1/21" 
[71] "2016/4/22"  "2015/10/21" "2015/11/12" "2015/12/28" "2015/12/30" "2015/11/6"  "2015/10/8"  "2015/12/6"  "2016/1/24"  "2016/1/17" 
[81] "2016/2/26"  "2016/3/6"   "2016/2/17"  "2016/1/11"  "2015/12/3"  "2016/2/11"  "2015/11/22" "2015/10/2"  "2015/10/3"  "2015/11/4" 
[91] "2016/2/10"  "2015/12/9"  "2015/10/9"  "2015/12/1"  "2016/2/25"  "2016/1/19"  "2016/1/18"  "2015/12/13" "2016/2/14"  "2016/3/10" 

class(data$Date)
[1] "character"

我将这个character 更改为date 格式,使用as.Date()

data$Date <- as.Date(data$Date)
Error in charToDate(x) : 
character string is not in a standard unambiguous format

我不知道如何弄清楚。我认为问题是数据中的""。我还有另一列名为Date2,但该列中不包含""
有什么建议吗?

另外,如果我想同时对两列进行as.Date 并定义像`as.Date(x, "%Y/%m/%d") 这样的指定格式,我该怎么办?

data[,c("Date", "Date2") := lapply(.SD, as.Date), .SDcols = c("Date", "Date2")]

【问题讨论】:

  • 指定日期的格式。 as.Date(x, "%Y/%m/%d")
  • 也许我可以先把""改成NA再改成date格式?这可能吗?
  • 那么as.Date 检查第一项以“猜测”格式,然后将其应用于每个后续项?因为as.Date(c("2015/10/5", "")) 很好。但是如果第一个是"",那它有问题吗?
  • @SymbolixAU - 如果您在控制台中输入 as.Date.character 并检查代码,您可以看到此检查。它似乎在做你猜想的事情。
  • 对于更新后的问题,data[, :=(Date = as.Date(Date), Date2 = as.Date(Date2))] 应该这样做(:= 周围有反引号)

标签: r date as.date


【解决方案1】:

根据 cmets 的规定,答案是您需要指定要转换的日期格式。在你的情况下是"%Y/%m/%d"

data$Date <- as.Date(data$Date, "%Y/%m/%d")

说明

您需要这样做的原因是因为您的向量中的第一个条目是"",并且您没有指定格式。

as.Date 函数在应用于字符时,首先检查是否缺少 format 参数。如果是,它会尝试根据向量的第一个元素来猜测格式。

它通过以下方式测试"%Y-%m-%d""%Y/%m/%d" 格式

xx <- ""
strptime(xx, "%Y-%m-%d")
NA
strptime(xx, "%Y/%m/%d")
NA

更具体地说,它使用以下测试(其中xx 是向量的第一个元素)

if(is.na(xx) || 
    !is.na(strptime(xx, f <- "%Y-%m-%d", tz = "GMT")) || 
    !is.na(strptime(xx, f <- "%Y/%m/%d", tz = "GMT"))){
  print("success!")     ## I added this print statement for illustration purposes
}else{
  stop("character string is not in a standard unambiguous format")
}

如您所见,xx 在所有if 条件下的计算结果为FALSE,因此该函数必须输入stop 方法。

要演示,请查看这些语句的结果

as.Date(c("2015/10/5", ""))  
# [1] "2015-10-05" NA
## SUCCESS, because it can 'guess' the first entry's format

as.Date(c("", "2015/10/5"))  
## ERROR: can't 'guess' the first entry's format

as.Date(c("2015/10/5", ""), format = "%Y/%m/%d") 
# [1] "2015-10-05" NA
## SUCCESS, because you've specified the format

as.Date(c("2015-10/5", "")) 
## ERROR: you haven't specified the format, 
## AND it's not one of the 'guessed' options ("%Y-%m-%d", "%Y/%m/%d")

【讨论】:

    【解决方案2】:

    可以使用lubridate包中的ymd()函数来转换日期,空字符串会被转换为NA。例如,

    > library(lubridate)
    > (newdates <- ymd(dates))
       [1] NA           "2016-12-20" "2016-12-27" "2017-01-07" "2017-01-27" "2017-02-01" "2017-02-02" "2017-02-05"
       [9] "2017-02-06" "2017-02-07" "2017-02-08" "2017-02-10" "2017-02-11" "2017-02-13" "2017-02-14" "2017-02-15"
       [17] "2017-02-17" "2017-02-16" "2017-02-24" "2017-02-19" "2017-02-21" "2017-02-20" "2017-02-26" "2017-02-22"
       [25] "2017-03-02" "2017-02-25" "2017-02-28" "2017-03-01" "2017-03-04" "2017-03-05" "2017-03-06" "2017-03-10"
       [33] "2017-03-08" "2017-03-09" "2017-03-11" "2017-03-12" "2017-03-13" "2017-03-15" "2017-03-29" "2017-05-13"
       [41] "2015-10-05" "2016-02-22" "2015-03-06" "2015-03-07" "2015-10-15" "2015-03-09" "2016-01-30" "2015-10-29"
       [49] "2015-10-24" "2015-10-17" "2016-01-08" "2015-09-24" "2016-02-15" "2015-12-08" "2015-12-10" "2016-02-06"
       [57] "2015-11-29" "2016-01-23" "2015-10-11" "2016-02-16" "2015-09-28" "2016-01-29" "2015-11-27" "2015-10-12"
       [65] "2015-11-01" "2015-11-16" "2015-10-10" "2015-11-30" "2016-01-02" "2016-01-21" "2016-04-22" "2015-10-21"
       [73] "2015-11-12" "2015-12-28" "2015-12-30" "2015-11-06" "2015-10-08" "2015-12-06" "2016-01-24" "2016-01-17"
       [81] "2016-02-26" "2016-03-06" "2016-02-17" "2016-01-11" "2015-12-03" "2016-02-11" "2015-11-22" "2015-10-02"
       [89] "2015-10-03" "2015-11-04" "2016-02-10" "2015-12-09" "2015-10-09" "2015-12-01" "2016-02-25" "2016-01-19"
       [97] "2016-01-18" "2015-12-13" "2016-02-14" "2016-03-10"
    > is.Date(newdates)
     [1] TRUE    
    

    【讨论】:

      猜你喜欢
      • 2022-07-04
      • 2021-05-07
      • 2016-06-25
      • 1970-01-01
      • 2018-02-23
      • 1970-01-01
      • 2021-09-01
      • 1970-01-01
      相关资源
      最近更新 更多