解析xml到tbxml后删除特殊字符

时间:2013-08-01 21:35:35

标签: html ios objective-c xml cocoa-touch

我在解析XML内容并在ui中显示时遇到问题。 XML具有类似于

的元素的内容
<fullDescription>3.2GHz PowerPC CPU  ATI GPU  512 MB 700 MHz GDDR3 RAM  1x Wireless Game Controller  3x USB 2.0 port  XBOX Live ready  20GB HDD  HD-AV-Kabel für High-Definition Output (720p, 1080i)  inkl.</fullDescription>

然而,在我解析它(通过TBXML)后,我得到字符串

3.2GHz PowerPC CPU  ATI GPU  512 MB 700 MHz GDDR3 RAM  1x Wireless Game Controller  3x USB 2.0 port  XBOX Live ready  20GB HDD  HD-AV-Kabel für High-Definition Output (720p, 1080i)  inkl.

我已经尝试了许多已经提到的解决方案来清理特殊字符,例如 HTML character decoding in Objective-C / Cocoa Touch 甚至修改了方法以包含“”,双重空间似乎没有任何作用..

我无法使用Github NSString category for HTML,因为该代码似乎与ARC不兼容,当我尝试在我的项目中使用它时,我遇到了各种各样的错误。

有人可以帮助我朝正确的方向努力吗...在这一段时间里把我的头发拉出来:-(我想我必须有一种简单的方法以通用的方式做到这一点。

2 个答案:

答案 0 :(得分:0)

只需检查xml文件的编码与xml标题中指示的相同。

答案 1 :(得分:0)

你试过这个吗?

//  NSString_stripHtml.h
//  Copyright 2011 Leigh McCulloch. Released under the MIT license.

#import <Foundation/Foundation.h>

@interface NSString (stripHtml)
- (NSString*)stripHtml;
@end

//  NSString_stripHtml.m
//  Copyright 2011 Leigh McCulloch. Released under the MIT license.

#import "NSString_stripHtml.h"

@interface NSString_stripHtml_XMLParsee : NSObject<NSXMLParserDelegate> {
@private
    NSMutableArray* strings;
}
- (NSString*)getCharsFound;
@end

@implementation NSString_stripHtml_XMLParsee
- (id)init {
    if((self = [super init])) {
        strings = [[NSMutableArray alloc] init];
    }
    return self;
}

- (void)parser:(NSXMLParser*)parser foundCharacters:(NSString*)string {
    [strings addObject:string];
}
- (NSString*)getCharsFound {
    return [strings componentsJoinedByString:@""];
}
@end

@implementation NSString (stripHtml)
- (NSString*)stripHtml {
    // take this string obj and wrap it in a root element to ensure only a single root element exists
    NSString* string = [NSString stringWithFormat:@"<root>%@</root>", self];

    // add the string to the xml parser
    NSStringEncoding encoding = string.fastestEncoding;
    NSData* data = [string dataUsingEncoding:encoding];
    NSXMLParser* parser = [[NSXMLParser alloc] initWithData:data];

    // parse the content keeping track of any chars found outside tags (this will be the stripped content)
    NSString_stripHtml_XMLParsee* parsee = [[NSString_stripHtml_XMLParsee alloc] init];
    parser.delegate = parsee;
    [parser parse];

    // log any errors encountered while parsing
    //NSError * error = nil;
    //if((error = [parser parserError])) {
    //    NSLog(@"This is a warning only. There was an error parsing the string to strip HTML. This error may be because the string did not contain valid XML, however the result will likely have been decoded correctly anyway.: %@", error);
    //}

    // any chars found while parsing are the stripped content
    NSString* strippedString = [parsee getCharsFound];

    // get the raw text out of the parsee after parsing, and return it
    return strippedString;
}
@end