使用PHP(DOM)迭代未知的XML结构

时间:2009-06-16 22:56:53

标签: php xml arrays parsing loops

我想编写一个函数,将(理论上)未知的XML数据结构解析为等效的PHP数组。

以下是我的示例XML:

<?xml version="1.0" encoding="UTF-8"?>
<content>

<title>Sample Text</title>

<introduction>
    <paragraph>This is some rudimentary text</paragraph>
</introduction>
<description>
    <paragraph>Here is some more text</paragraph>
    <paragraph>Even MORE text</paragraph>
    <sub_section>
        <sub_para>This is a smaller, sub paragraph</sub_para>
        <sub_para>This is another smaller, sub paragraph</sub_para>
    </sub_section>
</description>
</content>

我从devarticles修改了这个DOM迭代函数:

$data = 'path/to/xmldoc.xml';
$xmlDoc = new DOMDocument(); #create a DOM element
$xmlDoc->load( $data ); #load data into the element
$xmlRoot = $xmlDoc->firstChild; #establish root

function xml2array($node)
    {
    if ($node->hasChildNodes())
    {
$subNodes = $node->childNodes;
    foreach ($subNodes as $subNode)
        {
        #filter node types
        if (($subNode->nodeType != 3) || (($subNode->nodeType == 3)))   
            {
            $arraydata[$subNode->nodeName]=$subNode->nodeValue;
            }
         xml2array($subNode);
         }
      }
      return $arraydata;
   }
//The getNodesInfo function call

 $xmlarray = xml2array($xmlRoot);


// print the output - with a little bit of formatting for ease of use...
foreach($xmlarray as $xkey)
     {
     echo"$xkey<br/><br/>";
     }

现在,由于我将元素传递给数组的方式,我正在覆盖共享节点名称的所有元素(因为我理想地希望为这些键提供与其原始节点相同的名称)。我的递归并不好...但是,即使我清空括号 - 第二层节点仍然作为第一层的进入(参见描述节点的文本)。

任何人都有任何想法如何更好地构建它?

3 个答案:

答案 0 :(得分:2)

你可能最好只是从网上抓取一些代码

http://www.bin-co.com/php/scripts/xml2array/

    /**
     * xml2array() will convert the given XML text to an array in the XML structure.
     * Link: http://www.bin-co.com/php/scripts/xml2array/
     * Arguments : $contents - The XML text
     *                $get_attributes - 1 or 0. If this is 1 the function will get the attributes as well as the tag values - this results in a different array structure in the return value.
     *                $priority - Can be 'tag' or 'attribute'. This will change the way the resulting array sturcture. For 'tag', the tags are given more importance.
     * Return: The parsed XML in an array form. Use print_r() to see the resulting array structure.
     * Examples: $array =  xml2array(file_get_contents('feed.xml'));
     *              $array =  xml2array(file_get_contents('feed.xml', 1, 'attribute'));
     */
    function xml2array($contents, $get_attributes=1, $priority = 'tag') { 

答案 1 :(得分:1)

您可能对SimpleXMLxml_parse_into_struct感兴趣。

$ arraydata既没有传递给后续调用xml2array(),也没有使用返回值,所以是“我的递归不是很好......”是真的;-)
要将新元素附加到现有数组,可以使用空方括号$ arr [] = 123; $ arr [$ x] [] = 123;

答案 2 :(得分:0)

您可能还想查看XML Unserializer

http://pear.php.net/package/XML_Serializer/redirected