使用RegEx从字符串中提取多个值

时间:2011-01-08 17:43:13

标签: c# .net regex

我有一个输入字符串,如下例所示:

string.Format("Document {0}, was saved by {1} on {2}. The process was completed 
{3} milliseconds and data was received.", 
"Document.docx", "John", "1/1/2011", 45);

它生成的字符串如下:

Document Document.docx, was saved by John on 1/1/2011. The process was completed 
45 milliseconds and data was received.

从其他应用程序收到此类字符串后,最简单的方法是使用正则表达式进行解析并提取值Document.docxJohn1/1/2011,{{1}来自它。

我正在寻找最简单的方法,因为我们必须解析许多不同的输入字符串。

1 个答案:

答案 0 :(得分:8)

您可以使用以下内容:

private static readonly Regex pattern =
    new Regex("^Document (?<document>.*?), was saved by (?<user>.*?) on " +
        "(?<date>.*?)\\. The process was completed (?<duration>.*?) " +
        "milliseconds and data was received\\.$");

public static bool MatchPattern(
    string input,
    out string document,
    out string user,
    out string date,
    out string duration)
{
    document = user = date = duration = null;

    var m = pattern.Match(input);
    if (!m.Success)
        return false;

    document = m.Groups["document"].Value;
    user = m.Groups["user"].Value;
    date = m.Groups["date"].Value;
    duration = m.Groups["duration"].Value;

    return true;
}

返回包含所需信息的复合类型而不是使用out参数可能值得重构。但这种方法仍然有效。

要使用此代码,您可以执行以下操作:

var input = "...";

string document, user, date, duration;
if (MatchPattern(input, out document, out user, out date, out duration)) {
    // Match was successful.
}