字符串解析,用逗号分隔,除非它被撇号括起来

时间:2012-10-15 20:06:12

标签: objective-c ios cocoa-touch parsing

我需要在Objective-C for iOS app

中解析以下字符串

NSString * htmlString = @“12,22,'stringA','','stringB,stringC',2,'stringD'”;

我想要像这样的数组

{
    @12,
    @22,
    @"stringA",
    @"emptySlotInfo",
    @"stringB, stringC",
    @2,
    @"stringD"
}

头痛是@“strinb,stringC”,因为

[htmlString componentsSeparatedByString:@","];

不适用于案例,@“'”因为分隔符也不起作用。

如何获得必要的组件?

1 个答案:

答案 0 :(得分:2)

您可以使用NSScanner

如果它扫描',它知道字符串正在启动并忽略,,直到它读取下一个'。如果没有读取开放',请按,进行分配。

This cocoawithlove article可能会有所帮助。


我制作了一个快速原型。最有可能有很多优化,因为我也不是NSScanner的专家

NSString *htmlString = @"12, 22, 'stringA','', 'stringB, stringC', 2,'stringD'";
NSScanner *scanner = [NSScanner scannerWithString:htmlString];

NSString *apostrophe = @"'";    // scanner needs to detect this
NSString *comma = @",";         // scanner needs to detect this
NSCharacterSet *charSet = [NSCharacterSet characterSetWithCharactersInString:[NSString stringWithFormat:@"%@%@", apostrophe, comma]];
BOOL apostropheOpen = NO;       // is the scan location inside a single quoted substring?
NSInteger lastCommaIndex = -1;  // track last found comma's index
NSMutableArray *array = [NSMutableArray array];

while (![scanner isAtEnd]) {
    [scanner scanUpToCharactersFromSet:charSet intoString:NULL];
    NSString *charAtlocation = [htmlString substringWithRange:NSMakeRange([scanner scanLocation], 1)];
    if ([charAtlocation isEqualToString:apostrophe]){
        apostropheOpen = !apostropheOpen;                
    } else if ([charAtlocation isEqualToString:comma]){
        if (!apostropheOpen) {
            [array addObject: [scanner.string substringWithRange:NSMakeRange(lastCommaIndex+1, [scanner scanLocation]- lastCommaIndex-1)]];
            lastCommaIndex = [scanner scanLocation];
        }
    }
    [scanner setScanLocation:[scanner scanLocation]+1];
} ;

// scanner only dealt with the string until the last comma, probably one more value to handle
if (lastCommaIndex < [scanner scanLocation]){
    [array addObject: [scanner.string substringWithRange:NSMakeRange(lastCommaIndex+1, [scanner scanLocation]- lastCommaIndex-1)]];
}

// array contains seperated strings, but with blanks and apostrophes
// we will deal with them now
__block NSMutableArray *resultArray = [NSMutableArray array];
[array enumerateObjectsUsingBlock:^(NSString *obj, NSUInteger idx, BOOL *stop) {
    obj = [[obj stringByTrimmingCharactersInSet:[NSCharacterSet whitespaceCharacterSet]]
                stringByTrimmingCharactersInSet:charSet];
    if ([obj length] > 0)
        [resultArray addObject:obj];
    else
        [resultArray addObject:@"emptySlotInfo"];
}];

resultArray包含

(
12,
22,
stringA,
emptySlotInfo,
stringB, stringC,
2,
stringD
)