Question

我写了以下正则表达式

(1[012]|[1-9])(am|pm)\-(1[012]|[1-9])(am|pm)

匹配以下类型的时间格式：

7am-10pm (matches correctly and creates 4 match groups 7, am, 10, pm)

13am-10pm (this should not be matched, however it matches and creates 4 match groups 3, am, 10, pm)

10pm (this doesn't match as expected because it doesn't specify the time range end)

111am-10pm (this should not be matched, however it matches and creates 4 match groups 11, am, 10, pm)

如何改进我的正则表达式，以便我不需要重复数字和上午/下午模式以及以下内容：

它只捕获时间范围组件，例如早上7点到10点，应该只有2个匹配组，早上7点，上午10点。
它仅匹配适当的小时数，例如上午111点或下午13点等应视为不匹配。
我不知道是否可以使用正则表达式，但我们是否可以使正则表达式匹配正确的时间范围，例如上午7点到下午1点应该匹配，但是下午4点到下午1点应该被视为不匹配？

注意：我使用的是Ruby 2.2.1

感谢。

Answer 1

你的正则表达式中缺少^（行的开头），这就是它们之间匹配的原因。

你必须使用：

^(1[012]|[1-9])(am|pm)\-(1[012]|[1-9])(am|pm)

更好的解决方案：如果您的模式并非始终从新行开始，您也可以使用\b（边界）。

\b(1[012]|[1-9])(am|pm)\-(1[012]|[1-9])(am|pm)\b

请参阅DEMO。

Answer 2

首先，让我们看看你做错了什么：

13 am-10pm（这不应该匹配，但它匹配并创建4个匹配组3，am，10，pm）

它仅匹配适当的小时数，例如上午111点或下午13点等应视为不匹配。

匹配，因为你允许在这里匹配一个数字[1-9] :( 1 [012] | [1-9]）。

为了解决此问题，您应该允许一个[1-9]数字或1 + [0-2]。由于我们不知道正则表达式何时开始，我们将使用一些单词边界来确保我们有一个＆＃34;单词start＆＃34;。

由于您不想捕获数字，但整个时间加上am | pm，您可以使用非捕获组：

\b((?:1[0-2]|[1-9])

然后，这只是重复我们自己并加上冲刺的问题：

\b((?:1[0-2]|[1-9])[ap]m)-((?:1[0-2]|[1-9])[ap]m)

关于第3点。嗯，是的，可以使用正则表达式执行此操作，但是一旦获得第1组和第2组，只需添加逻辑检查以查看时间范围是否真的更好有道理。

总而言之，这就是你得到的：

# \b((?:1[0-2]|[1-9])[ap]m)-((?:1[0-2]|[1-9])[ap]m)
# 
# 
# Assert position at a word boundary «\b»
# Match the regular expression below and capture its match into backreference number 1 «((?:1[0-2]|[1-9])[ap]m)»
#    Match the regular expression below «(?:1[0-2]|[1-9])»
#       Match either the regular expression below (attempting the next alternative only if this one fails) «1[0-2]»
#          Match the character “1” literally «1»
#          Match a single character in the range between “0” and “2” «[0-2]»
#       Or match regular expression number 2 below (the entire group fails if this one fails to match) «[1-9]»
#          Match a single character in the range between “1” and “9” «[1-9]»
#    Match a single character present in the list “ap” «[ap]»
#    Match the character “m” literally «m»
# Match the character “-” literally «-»
# Match the regular expression below and capture its match into backreference number 2 «((?:1[0-2]|[1-9])[ap]m)»
#    Match the regular expression below «(?:1[0-2]|[1-9])»
#       Match either the regular expression below (attempting the next alternative only if this one fails) «1[0-2]»
#          Match the character “1” literally «1»
#          Match a single character in the range between “0” and “2” «[0-2]»
#       Or match regular expression number 2 below (the entire group fails if this one fails to match) «[1-9]»
#          Match a single character in the range between “1” and “9” «[1-9]»
#    Match a single character present in the list “ap” «[ap]»
#    Match the character “m” literally «m»

正则表达式匹配上午/下午7点至晚上10点的时间范围

2 个答案: