年份未正确显示正则表达式

时间:2015-05-12 08:57:54

标签: c# regex

使用正则表达式来匹配这一行文本,我发现这一年仍然显示为2015-01-07,而不仅仅是2015。任何人都可以看到我的正则表达式有什么问题吗?

代码行:

    2015-01-07 Wed Jan 07 11:03:43.390 DD Started

我的正则表达式:

    (?<date>(?<year>(?:\d{4}|\d{2})-(?<month>\d{1,2})-(?<day>\d{1,2})))\s(?<logEntry1>.*)\s(?<logEntry2>.*)\s(?<logEntry3>.*)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)

为什么我不能单独列出&#39;年份? 我通过regex101.com运行它,这里是捕获组值:

    MATCH 1
    date    [0-10]  `2015-01-07`
    year    [0-10]  `2015-01-07`
    month   [5-7]   `01`
    day [8-10]  `07`
    logEntry1   [11-14] `Wed`
    logEntry2   [15-18] `Jan`
    logEntry3   [19-21] `07`
    time    [22-34] `11:03:43.390`
    hour    [22-24] `11`
    minutes [25-27] `03`
    seconds [28-30] `43`
    milli   [31-34] `390`
    logEntry    [35-45] `DD Started`

2 个答案:

答案 0 :(得分:2)

(?<date>(?<year>(?:\d{4}|\d{2}))-(?<month>\d{1,2})-(?<day>\d{1,2}))\s(?<logEntry1>.*?)\s(?<logEntry2>.*?)\s(?<logEntry3>.*?)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)

                              ^^

试试这个。看看演示。你在一年之后错过了)

https://regex101.com/r/oF9hR9/16

答案 1 :(得分:2)

您应该将括号从)))移到(?:\d{4}|\d{2}))

(?<date>(?<year>(?:\d{4}|\d{2}))-(?<month>\d{1,2})-(?<day>\d{1,2}))\s(?<logEntry1>.*)\s(?<logEntry2>.*)\s(?<logEntry3>.*)\s(?<time>(?<hour>\d{2}):(?<minutes>\d{2}):(?<seconds>\d{2}).(?<milli>\d{0,3}))\s(?<logEntry>.*)

实际上,我宁愿使用带有如此长的正则表达式的 VERBOSE 正则表达式选项,并使用# Year之类的注释来跟踪你的内容:

var rx = new Regex(@"(?<date>
     (?<year>
       (?:\d{4}|\d{2})
     )                  # Year
     -
     (?<month>\d{1,2})
     -
     (?<day>\d{1,2})
   )                    # Date
   \s
   (?<logEntry1>.*)
   \s
   (?<logEntry2>.*)
   \s
   (?<logEntry3>.*)
   \s
   (?<time>
     (?<hour>\d{2})
     :
     (?<minutes>\d{2})
     :
     (?<seconds>\d{2})
     .
     (?<milli>\d{0,3})
   )
   \s
   (?<logEntry>.*)", RegexOptions.IgnorePatternWhitespace);

结果:

enter image description here