我有一些XML格式的RSS源,我在Android应用中收到了这些RSS源。这个想法很简单,它只是接收RSS更新并显示它们。我管理了一切,除了显示一些正确的话。问题是数据是用其他语言编写的,而我的应用程序是用英语编写的(我对这些东西知之甚少)。示例:
09:05KAMIONDŽIJE
20:05DočeknašiholimpijacaispredSkupštinegrada
等等......你看到那些Ž,č,š字母..它们显示为其他一些未知的语言..有谁知道如何解决这个问题...我只是想让它成为它是用RSS的XML文件编写的,无论如何都没有改变。
这是我完整的解析器类:
public class XMLParser {
// constructor
public XMLParser() {
}
/**
* Getting XML from URL making HTTP request
* @param url string
* */
public String getXmlFromUrl(String url) {
String xml = null;
try {
// defaultHttpClient
DefaultHttpClient httpClient = new DefaultHttpClient();
HttpPost httpPost = new HttpPost(url);
HttpResponse httpResponse = httpClient.execute(httpPost);
HttpEntity httpEntity = httpResponse.getEntity();
xml = EntityUtils.toString(httpEntity);
} catch (UnsupportedEncodingException e) {
e.printStackTrace();
} catch (ClientProtocolException e) {
e.printStackTrace();
} catch (IOException e) {
e.printStackTrace();
}
// return XML
return xml;
}
/**
* Getting XML DOM element
* @param XML string
* */
public Document getDomElement(String xml){
Document doc = null;
DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
dbf.setCoalescing(true);
dbf.setNamespaceAware(true);
try {
DocumentBuilder db = dbf.newDocumentBuilder();
InputSource is = new InputSource();
is.setCharacterStream(new StringReader(xml));
doc = db.parse(is);
} catch (ParserConfigurationException e) {
Log.e("Error: ", e.getMessage());
return null;
} catch (SAXException e) {
Log.e("Error: ", e.getMessage());
return null;
} catch (IOException e) {
Log.e("Error: ", e.getMessage());
return null;
}
return doc;
}
/** Getting node value
* @param elem element
*/
public final String getElementValue( Node elem ) {
Node child;
if( elem != null){
if (elem.hasChildNodes()){
for( child = elem.getFirstChild(); child != null; child = child.getNextSibling() ){
//if( child.getNodeType() == Node.TEXT_NODE ){
if(child.getNodeType() == Node.TEXT_NODE || child.getNodeType() == Node.CDATA_SECTION_NODE){
return child.getNodeValue();
}
}
}
}
return "";
//return elem.getTextContent();
}
/**
* Getting node value
* @param Element node
* @param key string
* */
public String getValue(Element item, String str) {
NodeList n = item.getElementsByTagName(str);
return this.getElementValue(n.item(0));
}
public String getValue2(Element item, String str){
//NodeList n = item.getElementsByTagNameNS("http://purl.org/rss/1.0/modules/content/", str);
return item.getTextContent();
}
}
答案 0 :(得分:0)
您应确保将文本解析为utf-8。但是当你这样做的时候,你仍然会有一堆奇怪的字符,但这很正常......许多语言都有奇怪的字符。丹麦(我的根)有æøå。
修改强>:
试试这个:
EntityUtils.toString(httpEntity,"UTF-8");