XML解析错误“格式良好”

时间:2016-05-27 08:52:48

标签: java xml parsing

我在数据库中有XML数据(不是文件) 我需要解析它以提供可能性写入测试以验证xml中的数据

xml(内容数据):

 <brid:AccountData xmlns:brid="http://billing.a1telekom.at/BridgeIT/BridgeITDefinition">
 <brid:AccountNumber>250000000032</brid:AccountNumber>
 <brid:CustomerNumber>100104653</brid:CustomerNumber>
 <brid:AccountType>NORM</brid:AccountType>
 <brid:BillCycle>M2</brid:BillCycle>
 <brid:LastInvoiceDate>0001-01-01T00:00:00.000</brid:LastInvoiceDate>
 <brid:BillThroughDate>0001-01-01T00:00:00.000</brid:BillThroughDate>
 <brid:StartDate>2016-02-26T15:27:13</brid:StartDate>
 <brid:EndDate>9999-12-31T23:59:59.000</brid:EndDate>
 <brid:AccountStatus>Active</brid:AccountStatus>
 <brid:TaxCode>U2</brid:TaxCode>
 <brid:CostCentre></brid:CostCentre>
</brid:AccountData>
<brid:PaymentData xmlns:brid="http://billing.a1telekom.at/BridgeIT/BridgeITDefinition">
 <brid:PaymentMethod>Manual</brid:PaymentMethod>
</brid:PaymentData>
<brid:MediaData xmlns:brid="http://billing.a1telekom.at/BridgeIT/BridgeITDefinition">
 <brid:AccountNumber>250000000032</brid:AccountNumber>
 <brid:MediaType>PAPIER</brid:MediaType>
 <brid:StartDate>2016-02-26T15:27:13</brid:StartDate>
 <brid:EndDate>9999-12-31T23:59:59.000</brid:EndDate>
 <brid:InvoiceName>ApuiafgjkLrjgdna Fouydf</brid:InvoiceName>
 <brid:Language>DE</brid:Language>
 <brid:LocationID>118298</brid:LocationID>
 <brid:FirstName>Fouydf</brid:FirstName>
 <brid:LastName>ApuiafgjkLrjgdna</brid:LastName>
 <brid:TitleCode></brid:TitleCode>
 <brid:TitleText></brid:TitleText>
 <brid:Prefix></brid:Prefix>
 <brid:Suffix>Cxvb</brid:Suffix>
 <brid:Currency>EUR</brid:Currency>
 <brid:Format>2</brid:Format>
 <brid:Atomized>N</brid:Atomized>
</brid:MediaData>

我如何尝试解析它:

P

ackage simbaOnlineReplacement;

import java.io.StringReader;
import javax.xml.parsers.DocumentBuilder;
import javax.xml.parsers.DocumentBuilderFactory;
import org.w3c.dom.Document;
import org.w3c.dom.Element;
import org.w3c.dom.Node;
import org.w3c.dom.NodeList;
import org.xml.sax.InputSource;
import ta.maxcare.help.DBSor;




public class XMLTest2 {
    public static void main(String[] args) {

        try {
            String data = new DBSor().getContentData("106897066");

            DocumentBuilderFactory dbFactory = DocumentBuilderFactory.newInstance();
            DocumentBuilder dBuilder = dbFactory.newDocumentBuilder();
            InputSource is = new InputSource(new StringReader(data));
            Document doc = dBuilder.parse(is);
            doc.getDocumentElement().normalize();
            System.out.println("Root element :" + doc.getDocumentElement().getNodeName());
            NodeList nList = doc.getElementsByTagName("MediaData");
            System.out.println("----------------------------");
            for (int temp = 0; temp < nList.getLength(); temp++) {
                Node nNode = nList.item(temp);
                System.out.println("\nCurrent Element :" + nNode.getNodeName());
                if (nNode.getNodeType() == Node.ELEMENT_NODE) {
                    Element eElement = (Element) nNode;

                    System.out.println("First Name : " + eElement.getElementsByTagName("FirstName").item(0).getTextContent());

                    System.out.println("LastName : " + eElement.getElementsByTagName("LastName").item(0).getTextContent());
                }
            }
        } catch (Exception e) {
            e.printStackTrace();
        }
    }
}

和e给出错误:

  

[致命错误]:14:2:根目录后面的文档中的标记   元素必须格式良好。 org.xml.sax.SAXParseException;   lineNumber:14; columnNumber:2;以下文档中的标记   根元素必须格式良好。在   com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(未知   来源)at   com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(未知   来源):simbaOnlineReplacement.XMLTest2.main(XMLTest2.java:23)

2 个答案:

答案 0 :(得分:1)

在格式良好的XML中,您必须拥有一个根元素。 brid:AccountDatabrid:PaymentDatabrid:MediaData处于同一级别,这是错误的。作为一个例子,你应该有一个根元素将它们全部包围起来。

答案 1 :(得分:1)

您的XML是格式良好的外部常规解析实体,但它不是格式良好的文档。因此,处理它的最简单方法是创建一个简单的包装器文件,将其作为外部实体引用:

<!DOCTYPE wrapper [
<!ENTITY e SYSTEM "data.xml">
]>
<wrapper>&e;</wrapper>

然后解析包装文件。