【发布时间】:2013-12-20 13:15:08
【问题描述】:
如果我们有字符串,如何编写和正则表达式从字符串中获取列表:
value = '88-94'
value = '88 to 94'
value = '88'
value = '88-94, 96-108'
结果应该是:
[88, 89, 90, 91, 92, 93, 94]
[88, 89, 90, 91, 92, 93, 94]
[88]
[88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108]
编程语言是python2.7
这是适用于 python2.7 和正则表达式的有效解决方案,但必须检查最后一个具有单个值作为单独案例的案例:
>>> import regex
>>> m = regex.match(r"(?:(?P<digits>\d+).(?P<digits>\d+))", "88-94")
>>> a = m.captures("digits")
>>> a
['88', '94']
>>> m = regex.match(r"(?:(?P<digits>\d+).(?P<digits>\d+))", "88 94")
>>> a = m.captures("digits")
>>> a
['88', '94']
>>> range(int(a[0]), int(a[1])+1)
[88, 89, 90, 91, 92, 93, 94]
>>>
这是一个解决上述情况的解决方案,但是 88-94、96-98 等呢
>>> import re
>>> a = map(int, re.findall(r'\d+', '88-94'))
>>> range(a[0], a[-1]+1)
[88, 89, 90, 91, 92, 93, 94]
>>> a = map(int, re.findall(r'\d+', '88 94'))
>>> range(a[0], a[-1]+1)
[88, 89, 90, 91, 92, 93, 94]
>>> a = map(int, re.findall(r'\d+', '88'))
>>> range(a[0], a[-1]+1)
[88]
>>>
几乎涵盖所有案例的解决方案:
>>> import re
>>> a = map(int, re.findall(r'\d+', '88-94, 96-108'))
>>> c = zip(a[::2], a[1::2])
>>> [m for k in [range(i,j+1) for i, j in c] for m in k]
[88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108]
>>> a = map(int, re.findall(r'\d+', '88-94, 96-108, 125 129'))
>>> c = zip(a[::2], a[1::2])
>>> [m for k in [range(i,j+1) for i, j in c] for m in k]
[88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 125, 126, 127, 128, 129]
>>> a = map(int, re.findall(r'\d+', '88-94, 96-108, 125 129, 132 to 136'))
>>> c = zip(a[::2], a[1::2])
>>> [m for k in [range(i,j+1) for i, j in c] for m in k]
[88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 125, 126, 127, 128, 129, 132, 133, 134, 135, 136]
>>>
任何人都可以提出反对或投票赞成关闭的理由吗?
任何帮助将不胜感激,任何人都可以建议如何更新问题我不要求替代解决方案,因为我知道如何拆分和循环甚至重新剥离数字和循环,我的问题是如何使用 re 单如果可能的话?答案可能是“否”,但不是题外话。
【问题讨论】:
-
start, stop = map(int, re.findall(r'\d+', re.sub(r'^(\d+)$', r'\1-\1', value))); print range(start, stop+1)(2 条语句) -
你为什么要把它作为一个单一的语句来做?以清晰易读且对您有意义的方式进行操作。 Explicit is better than implicit; and simple is better than complex.
-