【发布时间】:2020-12-20 04:36:01
【问题描述】:
我正在尝试使用正则表达式将数据框中的日期与 500 个条目匹配:
日期可以以下列格式显示:
04/20/2009; 04/20/09; 4/20/09; 4/3/09
Mar-20-2009; Mar 20, 2009; March 20, 2009; Mar. 20, 2009; Mar 20 2009;
20 Mar 2009; 20 March 2009; 20 Mar. 2009; 20 March, 2009
Mar 20th, 2009; Mar 21st, 2009; Mar 22nd, 2009
Feb 2009; Sep 2009; Oct 2010
6/2008; 12/2009
2009; 2010
dates[dates[0].str.contains(r'(?P<year>\d?\d?\d\d)')].shape
返回一个shape(500,1)的元组
但是
dates[dates[0].str.contains(r'((?P\<day\>(\d?\d)?(\s|-|/|th|st|nd)?)??P\<year\>(\d?\d?\d\d))')].shape
返回一个shape(0,1)的元组,但是天组是可选的,所以它不应该仍然匹配年份组。
【问题讨论】: