使用正则表达式来匹配这一行文本,我发现这一年仍然显示为2015-01-07
,而不仅仅是2015
。任何人都可以看到我的正则表达式有什么问题吗?
代码行:
2015-01-07 Wed Jan 07 11:03:43.390 DD Started
我的正则表达式:
(?<date>(?<year>(?:\d{4}|\d{2})-(?<month>\d{1,2})-(?<day>\d{1,2})))\s(?<logEntry1>.*)\s(?<logEntry2>.*)\s(?<logEntry3>.*)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)
为什么我不能单独列出&#39;年份? 我通过regex101.com运行它,这里是捕获组值:
MATCH 1
date [0-10] `2015-01-07`
year [0-10] `2015-01-07`
month [5-7] `01`
day [8-10] `07`
logEntry1 [11-14] `Wed`
logEntry2 [15-18] `Jan`
logEntry3 [19-21] `07`
time [22-34] `11:03:43.390`
hour [22-24] `11`
minutes [25-27] `03`
seconds [28-30] `43`
milli [31-34] `390`
logEntry [35-45] `DD Started`
答案 0 :(得分:2)
(?<date>(?<year>(?:\d{4}|\d{2}))-(?<month>\d{1,2})-(?<day>\d{1,2}))\s(?<logEntry1>.*?)\s(?<logEntry2>.*?)\s(?<logEntry3>.*?)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)
^^
试试这个。看看演示。你在一年之后错过了)
。
答案 1 :(得分:2)
您应该将括号从)))
移到(?:\d{4}|\d{2}))
:
(?<date>(?<year>(?:\d{4}|\d{2}))-(?<month>\d{1,2})-(?<day>\d{1,2}))\s(?<logEntry1>.*)\s(?<logEntry2>.*)\s(?<logEntry3>.*)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)
实际上,我宁愿使用带有如此长的正则表达式的 VERBOSE 正则表达式选项,并使用# Year
之类的注释来跟踪你的内容:
var rx = new Regex(@"(?<date>
(?<year>
(?:\d{4}|\d{2})
) # Year
-
(?<month>\d{1,2})
-
(?<day>\d{1,2})
) # Date
\s
(?<logEntry1>.*)
\s
(?<logEntry2>.*)
\s
(?<logEntry3>.*)
\s
(?<time>
(?<hour>\d{2})
:
(?<minutes>\d{2})
:
(?<seconds>\d{2})
.
(?<milli>\d{0,3})
)
\s
(?<logEntry>.*)", RegexOptions.IgnorePatternWhitespace);
结果: