使用PHP中的XML-Reader进行XML验证

时间:2015-06-10 14:47:45

标签: php xml xsd xmlreader

我在验证生成的XML字符串时遇到错误。我使用XML-Reader加载XML-String并分配XSD-File进行验证。

有对象ID和URL可以根据允许的字符模式进行验证。我认为ID和网址是正确的。但为什么验证过程会产生错误?

我收到如下错误消息:

Element 'objectID': [facet 'pattern'] The value 'ffc89' is not accepted by the pattern '^[a-z]{1,1}[a-z0-9.-]{3,14}$'.
Element 'objectID': 'ffc89' is not a valid value of the local atomic type.
Element 'originUrl': [facet 'pattern'] The value 'http://domain.com/images/89/f972c66982290125.jpg' is not accepted by the pattern '^(http|https){1}(://){1}[a-zA-Z0-9\-\./#?&_]+'.
Element 'originUrl': 'http://domain.com/images/89/f972c66982290125.jpg' is not a valid value of the local atomic type.

以下是代码段:

$reader = new XMLReader();

// we enable user error handling
libxml_use_internal_errors(true);

// load xml sructure for testing against xsd
$reader->xml($xml_str_tocheck);
$reader->setSchema($xsd_file_name);

// read xml structure
while( $reader->read() ) ;

// close xml
$reader->close();

// get found xml errors
$errors = libxml_get_errors();

// we disable user error handling
// (Disabling will also clear any existing libxml errors.)
libxml_use_internal_errors(false);

// check if xml is not valid
if( count($errors) )
{
    foreach ($errors as $error)
    {
        echo $error->message;
    }
}

这是用于验证的XML字符串:

<?xml version="1.0" encoding="UTF-8"?>
<oimages startFetchDate="2015-06-10T12:48:20+00:00">
  <object>
    <objectID>ffc89</objectID>
    <images>
      <image>
        <originUrl>http://domain.com/images/89/f972c66982290125.jpg</originUrl>
      </image>
    </images>
  </object>
</oimages>

这是XSD文件:

<xs:schema attributeFormDefault="unqualified" elementFormDefault="qualified" xmlns:xs="http://www.w3.org/2001/XMLSchema">
  <xs:element name="images">
    <xs:complexType>
      <xs:sequence>
        <xs:element name="object" maxOccurs="unbounded" minOccurs="1">
          <xs:complexType>
            <xs:sequence>
              <xs:element name="objectID" minOccurs="1" maxOccurs="1">
                <xs:simpleType>
                  <xs:restriction base="xs:string">
                    <xs:minLength value="4"/>
                    <xs:maxLength value="15"/>
                    <xs:pattern value="^[a-z]{1,1}[a-z0-9.-]{3,14}$"/>
                  </xs:restriction>
                </xs:simpleType>
              </xs:element>
              <xs:element name="images" maxOccurs="1" minOccurs="1">
                <xs:complexType>
                  <xs:sequence>
                    <xs:element name="image" maxOccurs="unbounded" minOccurs="0">
                      <xs:complexType>
                        <xs:sequence>
                          <xs:element name="url" minOccurs="1" maxOccurs="1">
                            <xs:simpleType>
                              <xs:restriction base="xs:string">
                                <xs:minLength value="10"/>
                                <xs:pattern value="^(http|https){1}(://){1}[a-zA-Z0-9\-\./#?&amp;_]+" />
                              </xs:restriction>
                            </xs:simpleType>
                          </xs:element>
                          </xs:element>
                        </xs:sequence>
                      </xs:complexType>
                    </xs:element>
                  </xs:sequence>
                </xs:complexType>
              </xs:element>
            </xs:sequence>
          </xs:complexType>
        </xs:element>
      </xs:sequence>
    </xs:complexType>
  </xs:element>
</xs:schema>

1 个答案:

答案 0 :(得分:1)

您的XML对XSD无效。

对XSD进行以下更改:

  1. 删除第31行xs:element的额外结束标记。
  2. 将根元素名称从images更改为oimages
  3. startFetchDate属性添加到oimages
  4. 从中移除前导^和尾随$ ^[a-z]{1,1}[a-z0-9.-]{3,14}$因为XSD中的正则表达式 已经隐含在开始和结束时开始和结束 字符串。
  5. 从中删除前导^ ^(http|https){1}(://){1}[a-zA-Z0-9\-\./#?&amp;_]+
  6. 对XSD进行上述更改后,XML将根据XSD成功验证。