使用XmlReader解析子节点

时间:2013-02-17 03:28:10

标签: php xml xmlreader

我解析一个大的XML文档,在解析子节点时遇到了麻烦。以下是我正在尝试解析的示例。

<link rel="http://xxxxx/people.employees" title="employees">
    <people>
      <link href="/154" rel="http://catalog/person" title="Guy Nom" />
      <link href="/385" rel="http://catalog/person" title="Carrie Jin" />
      <link href="/162" rel="http://catalog/person" title="Joe Zee" />
      <link href="/2125" rel="http://catalog/person" title="Mark Polin" />
      <link href="/9293" rel="http://catalog/person" title="Stephen Castor" />
      <link href="/21822" rel="http://catalog/person" title="Callum Tinge" />
      <link href="/2022" rel="http://catalog/person" title="Brian Lennon" />
      <link href="/2040" rel="http://catalog/person" title="Jorja Fox" />
      <link href="/2046" rel="http://catalog/person" title="Harry Harris" />
      <link href="/2399" rel="http://catalog/person" title="Sam Muellerleile" />
    </people>
  </link>
  <link rel="http://xxxxx/people/others" title="others">
    <people>
      <link href="/7143" rel="http://catalog/person" title="James Smith" />
    </people>
  </link>

我需要区分“员工”和“其他人”,并将它们存储在单独的字段中。我想做类似下面的事情:

if($xmlReader->localName == 'link') {
    if ($xmlReader->getAttribute('title') == "employees"){
      //GO TO NEXT LINK TAG AND GET NAME
      $myObject->employees[$myObject->employees_count]['name'] = $xmlReader->getAttribute('title');
      $myObject->employees_count++;

    } else if ($xmlReader->getAttribute('title') == "others"){
      //GO TO NEXT LINK TAG AND GET NAME
      $myObject->others[$myObject->others_count]['name'] = $xmlReader->getAttribute('title');
      $myObject->others_count++;

    }
  }

显然,上面评论的内容对我来说是个问题。我不知道如何阅读这些子元素,在我看来,这方面的PHP文档并不是很好。我很感激任何帮助。

3 个答案:

答案 0 :(得分:2)

对于XmlReader,您可以使用$depth property<link>元素会有1(一个),所以当你继续阅读时,你可以检查当前元素是否仍然是孩子,因为你会看到一个END_ELEMENT相同的$depth,然后你知道孩子们都被消耗了。

昨天在一个答案中,我展示了如何通过从XML_Reader

扩展来封装该逻辑

它允许将父元素的深度传递给名为readToNextChildElement($depth)的新方法,该方法只允许遍历子元素。

用法示例:

$depth = $reader->depth; # parent elements depth
while ($reader->readToNextChildElement($depth)) {
    # only children
}

实施是:

class MyXMLReader extends XMLReader
{
    ...

    public function readToNextChildElement($depth)
    {
        // if the current element is the parent and
        // empty there are no children to go into
        if ($this->depth == $depth && $this->isEmptyElement) {
            return false;
        }

        while ($result = $this->read()) {
            if ($this->depth <= $depth) return false;
            if ($this->nodeType === self::ELEMENT) break;
        }

        return $result;
    }

    ...

您可以在链接的答案中找到其余代码。根据您的需要,这可能会有所帮助 - 如果您想要基于此XML_Reader。否则,如果您可以将整个文档加载到内存中,则可以更轻松地使用Xpath查询元素。

$employees_names = array_map(
    'strval', 
    $sxml->xpath('//link[@title="employees"]//link/@title')
);
那是SimpleXML

答案 1 :(得分:2)

使用XMLReader :: readInnerXML()

<?php
$reader = new XMLReader();
$reader->open("filename.xml");

while ($reader->read()) {

    if($reader->name=='Foo' && $reader->nodeType == XMLReader::ELEMENT) {

        $reader->moveToElement();
        $Foo = new SimpleXMLElement($reader->readOuterXml());

        //$Foo->bar

    }
}
$reader->close();
?>

答案 2 :(得分:1)

就个人而言,我会使用SimpleXML,因为XMLReader根本没有完整记录,并且(根据您的需要)如果没有XMLReader正常工作以解析其他部分,应该可以正常工作该文件。话虽这么说,这是我使用的代码,以及输入。

<强>的test.xml

<?xml version="1.0" encoding="UTF-8" ?>
<result>
<link rel="http://xxxxx/people.employees" title="employees">
    <people>
        <link href="/154" rel="http://catalog/person" title="Guy Nom" />
        <link href="/385" rel="http://catalog/person" title="Carrie Jin" />
        <link href="/162" rel="http://catalog/person" title="Joe Zee" />
        <link href="/2125" rel="http://catalog/person" title="Mark Polin" />
        <link href="/9293" rel="http://catalog/person" title="Stephen Castor" />
        <link href="/21822" rel="http://catalog/person" title="Callum Tinge" />
        <link href="/2022" rel="http://catalog/person" title="Brian Lennon" />
        <link href="/2040" rel="http://catalog/person" title="Jorja Fox" />
        <link href="/2046" rel="http://catalog/person" title="Harry Harris" />
        <link href="/2399" rel="http://catalog/person" title="Sam Muellerleile" />
    </people>
</link>
<link rel="http://xxxxx/people/others" title="others">
    <people>
        <link href="/7143" rel="http://catalog/person" title="James Smith" />
    </people>
</link>
</result>

然后用PHP解析该示例(注意,我没有包含你的变量,但你应该能够从中得到你需要的东西。另外,最后是验证,即显示,已填充的内容。)

<?php

$xml = simplexml_load_file('test.xml','SimpleXMLElement', LIBXML_NOCDATA);
//Place holder variables as I don't have access to the object.
$emp=array();$emp_count=0;$other=array();$other_count=0;

foreach($xml->link as $links) {
    $at = $links->attributes();
    if($at['title'] == 'employees') {
        foreach($links->people->link as $person) {
            $emp_count++;
            $employee = $person->attributes();
            $emp[] = (string)$employee['title'];
        }
    } elseif($at['title'] == 'others') {
        foreach($links->people->link as $person) {
            $other_count++;
            $others = $person->attributes();
            $other[] = (string)$others['title'];
        }
    }
}
echo "<pre>";
echo "Employees: $emp_count\n";
print_r($emp);

echo "Others: $other_count\n";
print_r($other);

echo "</pre>";
?>

这是输出(所以你不必自己运行^^)

Employees: 10
Array
(
    [0] => Guy Nom
    [1] => Carrie Jin
    [2] => Joe Zee
    [3] => Mark Polin
    [4] => Stephen Castor
    [5] => Callum Tinge
    [6] => Brian Lennon
    [7] => Jorja Fox
    [8] => Harry Harris
    [9] => Sam Muellerleile
)
Others: 1
Array
(
    [0] => James Smith
)

我希望有所帮助!