使用正则表达式在两个标记之间提取文本

时间:2015-03-14 15:17:25

标签: ios regex nsregularexpression

简单的正则表达式问题。我有一个以下格式的字符串:

[page]
Some text with multi line.
[page/]

[page]
Another text with multi line.
[page/]

[page]
Third text with multi line.
[page/]

[page][page/]之间提取文本的正则表达式是什么?

我正在使用此代码,但我只获得了第一场比赛。

NSString *path = [[NSBundle mainBundle] pathForResource:@"File" ofType:@"txt"];
NSString *mainText = [NSString stringWithContentsOfFile:path encoding:NSUTF8StringEncoding error:nil];

NSError *error = NULL;
NSRange range = NSMakeRange(0, mainText.length);

   NSString *pattern = [NSString stringWithFormat:@"(?<=\\[page])(?s)(.*?)(?=\\[page/])"];
        NSRegularExpression *regex = [NSRegularExpression regularExpressionWithPattern:pattern options:NSRegularExpressionCaseInsensitive error:&error];
        NSRange rangeOfFirstMatch = [regex rangeOfFirstMatchInString:mainText options:0 range:range];


        if (!NSEqualRanges(rangeOfFirstMatch, NSMakeRange(NSNotFound, 0))) {
            NSString *substringForFirstMatch = [mainText substringWithRange:rangeOfFirstMatch];
            NSLog(@"sub: %@", substringForFirstMatch);
        }

我怎么能在NSArray的每场比赛的文字?

1 个答案:

答案 0 :(得分:2)

你可以使用matchesInString:options:range:,它返回一个匹配数组作为NSTextCheckingResults:

    NSString *pattern = [NSString stringWithFormat:@"(?<=\\[page\\])(.*?)(?=\\[page\\/\\])"];
    NSUInteger options = NSRegularExpressionCaseInsensitive | NSRegularExpressionDotMatchesLineSeparators;
    NSRegularExpression *regex = [NSRegularExpression regularExpressionWithPattern:pattern options:options error:&error];

    for (NSTextCheckingResult* result in [regex matchesInString:INPUT_STRING 
                                                    options:0 
                                                      range:NSMakeRange(0, [input_string_length])])
    {
      // further code
    }