Question

我目前正在开发一个创建TCP套接字的项目，并监听服务器是否有传入的xml。 xml有时相当大，大约1-3 MB。 xml不断来自套接字，我需要解析它。我尝试了很多解析器，如DomParser，XMLPullParser和SaxParser。萨克斯似乎是最快的，所以我继续这样做。但是现在我有时会得到OutOfMemory。

我在这篇文章中读到，我们应该以块的形式向解析器提供数据。

How to parse huge xml data from webservice in Android application?

有人可以告诉我这是怎么做的。我目前的代码就像

InputSource xmlInputSource  =   new InputSource(new StringReader(response));
SAXParserFactory spf        =   SAXParserFactory.newInstance();
SAXParser sp                =   null;
XMLReader xr                =   null;
try{
    sp                      =   spf.newSAXParser();
    xr                      =   sp.getXMLReader();
    ParseHandler xmlHandler =   new ParseHandler(context.getSiteListArray().indexOf(website), context);
    xr.setContentHandler(xmlHandler);
    xr.parse(xmlInputSource);
    postSuccessfullParsingNotification();
}catch(SAXException e){
    e.printStackTrace();
}catch(ParserConfigurationException e){
    e.printStackTrace();
}catch (IOException e){
    e.printStackTrace();
    e.toString();
}

其中response是我从套接字收到的字符串。

应该考虑其他解析器，如VTD-XML？或者有没有办法让萨克斯有效地工作？

顺便说一句：每当一个新字符串到达要解析的套接字时，我打开一个新线程来解析字符串。

This is my handler code    

public class ParseHandler extends DefaultHandler {
    private Website     mWebsite;
    private Visitor     mVisitor;
    private VisitorInfo mVisitorInfo;
    private boolean     isVisit;
    private boolean     isVisitor;
    private AppContext  appContext;

    public ParseHandler(int index,AppContext context){
        appContext          =   context;
        mWebsite            =   appContext.getSiteListArray().get(index);
    }

    @Override
    public void startDocument() throws SAXException {
        super.startDocument();        
    }

    @Override
    public void startElement(String namespaceURI, String localName,String qName, Attributes atts) 
            throws SAXException {
        if(localName.equals("visit")) {
            isVisit = true;            
        } else if(localName.equals("visitor") && isVisit) {
            isVisitor  = true; 
            mVisitor = new Visitor();
            mVisitor.mDisplayName = "Visitor - #"+atts.getValue("id");
            mVisitor.mVisitorId   = atts.getValue("id");
            mVisitor.mStatus      = atts.getValue("idle");
        } else if(localName.equals("info") && isVisitor){
            mVisitorInfo = mVisitor.new VisitorInfo();
            mVisitorInfo.mBrowser     = atts.getValue("browser");
            mVisitorInfo.mBrowserName = atts.getValue("browser").replace("+", " ");
            mVisitorInfo.mCity        = atts.getValue("city").replace("+", " ");
            mVisitorInfo.mCountry     = atts.getValue("country");
            mVisitorInfo.mCountryName = atts.getValue("country");
            mVisitorInfo.mDomain      = atts.getValue("domain");
            mVisitorInfo.mIp          = atts.getValue("ip");
            mVisitorInfo.mLanguage    = atts.getValue("language");
            mVisitorInfo.mLatitude    = atts.getValue("lat");
            mVisitorInfo.mLongitude   = atts.getValue("long");
            mVisitorInfo.mOrg         = atts.getValue("org").replace("+", " ");
            mVisitorInfo.mOs          = atts.getValue("os");
            mVisitorInfo.mOsName      = atts.getValue("os").replace("+", " ");
            mVisitorInfo.mRegion      = atts.getValue("region").replace("+", " ");
            mVisitorInfo.mScreen      = atts.getValue("screen");
        }
    }   

    @Override
    public void characters(char ch[], int start, int length) {
    }

    @Override
    public void endElement(String namespaceURI, String localName, String qName) throws SAXException {
        if(localName.equals("visit")) {
            isVisit  = false;
        } else if(localName.equals("visitor")) {
            isVisitor = false;
            if(mVisitor == null){
                Log.e("mVisitor","mVisitor");
            } else if(mVisitor.mVisitorId == null){
                Log.e("mVisitor.mVisitorId","mVisitor.mVisitorId");   
            }
            mWebsite.mVisitors.put(mVisitor.mVisitorId, mVisitor);
        } else if(localName.equals("info")  && isVisitor) {
            mVisitor.mVisitorInfo = mVisitorInfo;
        }
    }

    @Override
    public void endDocument() throws SAXException {

    }
}

**

编辑：思考后......

**

经过进一步调查后，我发现我的解析没有导致异常。每次我从套接字收到一个流时，我都会将它存储在一个字符串中，并且我会一直追加它，直到我们在流中得到“\ n”。 “\ n”用于表示xml块的结尾。 字符串导致内存异常。我尝试了 StringBuilder ，但这也导致了同样的问题。我不知道为什么会这样。

现在我尝试直接发送输入流进行解析，但最后“\ n”导致解析异常。有什么我们可以设置，以便解析器将忽略“\ n”？

Answer 1

似乎你将整个xml文件传递给解析器，所以每当文件太大时，你都会得到outOfMemory异常。

您应该尝试以块的形式读取套接字的输出，并将其提供给解析器。所以你会在循环中执行xr.parse（）。

Answer 2

另一篇文章是关于我的问题制作的，而那里的答案是我问题的解决方案。

以下是解决此问题的人的解决方案。

Reading big chunk of xml data from socket and parse on the fly

Android SaxParser和OutOfMemory异常

编辑：思考后......

2 个答案: