PHP / XML - 如何读取多个子

时间:2013-05-23 12:01:21

标签: xml simplexml php

我需要创建一个包含此XML文件中所有主题值的数组。 ISIN列表似乎工作正常(第一个属性值),但主题值不起作用。

我想最终得到一个类似这样的数组:

$Companys = array ( [0]  => array ( "isin" => "DK0010247014","company" => "AAB"),
                    [1]  => array ( "isin" => "DK0015250344","company" => "ALM BRAND"),
                    [2]  => array ( "isin" => "DK0015998017","company" => "BAVARIAN NORDI"),
                    [3]  => array ( "isin" => "DK0010259027","company" => "DFDS"),
                    [4]  => array ( "isin" => "DK0010234467","company" => "FLSMIDTH & CO"),
                );

这是我尝试解析的其中一个文件的示例:

<doc>
    <id>123456</id>
    <version>4.0</version>
    <consnr>7861</consnr>
    <doctype>10</doctype>
    <dest>99</dest>
    <created>2013-05-15 14:18:16</created>
    <source>Direkt-DK</source>
    <language>DA</language>
    <texttype>This is a type</texttype>
    <premium>False</premium>
    <header>This is a header</header>
    <text>
        <para format="Text">This is a paragraph</para>
        <para format="Text">This is a paragraph</para>
        <para format="Text">This is a paragraph</para>
        <para format="Text">This is a paragraph</para>
        <para format="Text"/>
        <para format="Text">This is a paragraph</para>
        <para format="Byline"/>
        <para format="Byline">contents og the by line</para>
        <para format="Byline"/>
        <para format="Byline"/>
    </text>
    <subjects>
        <subject value="AAB" weight="Main">
            <property value="DK0010247014" type2="isin" type1="identificator"/>
            <property value="CSE:AAB" type2="ticker" type1="identificator"/>
            <property type1="sector" type2="GICS" type3="1" value="25"/>
            <property type1="sector" type2="GICS" type3="2" value="2530"/>
            <property type1="sector" type2="GICS" type3="3" value="253010"/>
            <property type1="sector" type2="GICS" type3="4" value="25301030"/>
        </subject>
        <subject value="ALM BRAND" weight="Main">
            <property value="DK0015250344" type2="isin" type1="identificator"/>
            <property value="CSE:ALMB" type2="ticker" type1="identificator"/>
            <property type1="sector" type2="GICS" type3="1" value="40"/>
            <property type1="sector" type2="GICS" type3="2" value="4030"/>
            <property type1="sector" type2="GICS" type3="3" value="403010"/>
            <property type1="sector" type2="GICS" type3="4" value="40301040"/>
        </subject>
        <subject value="BAVARIAN NORDI" weight="Main">
            <property value="DK0015998017" type2="isin" type1="identificator"/>
            <property value="CSE:BAVA" type2="ticker" type1="identificator"/>
            <property type1="sector" type2="GICS" type3="1" value="35"/>
            <property type1="sector" type2="GICS" type3="2" value="3520"/>
            <property type1="sector" type2="GICS" type3="3" value="352010"/>
            <property type1="sector" type2="GICS" type3="4" value="35201010"/>
        </subject>
        <subject value="DFDS" weight="Main">
            <property value="DK0010259027" type2="isin" type1="identificator"/>
            <property value="CSE:DFDS" type2="ticker" type1="identificator"/>
            <property type1="sector" type2="GICS" type3="1" value="20"/>
            <property type1="sector" type2="GICS" type3="2" value="2030"/>
            <property type1="sector" type2="GICS" type3="3" value="203030"/>
            <property type1="sector" type2="GICS" type3="4" value="20303010"/>
        </subject>
        <subject value="FLSMIDTH & CO" weight="Main">
            <property value="DK0010234467" type2="isin" type1="identificator"/>
            <property value="CSE:FLS" type2="ticker" type1="identificator"/>
            <property type1="sector" type2="GICS" type3="1" value="20"/>
            <property type1="sector" type2="GICS" type3="2" value="2010"/>
            <property type1="sector" type2="GICS" type3="3" value="201030"/>
            <property type1="sector" type2="GICS" type3="4" value="20103010"/>
        </subject>
    </subjects>
</doc>

脚本:

<?
    foreach($xmlObj->subjects->subject as $b ){
        $isin = $b->property;
        $company = $b->attributes();
        #$company = $b->attributes()->value;
        If($isin && $isinlist == 'null') $isinlist = $isin['value'];
        ElseIf ($isin && $isinlist) $isinlist .= ','.$isin['value'];
        If($company && $companylist == 'null') $companylist = $company['value'];
        ElseIf ($company && $companylist) $companylist .= ','.$company['value'];
        var_dump($company->value[0]);
    }
?>

1 个答案:

答案 0 :(得分:0)

您遇到的主要问题是根据属性值查找子元素。由于有多个子元素具有相同的元素名称,因此您无法区分名称。

在具体示例中,基于属性 type2 =&#34; isin&#34; 属性子项。

这可能是通过使用Xpath(这个网站已经有很多Q&amp; A材料,例如SimpleXML: Selecting Elements Which Have A Certain Attribute Value)或通过扩展SimpleXMLElement来实现它的功能:

class MyElement extends SimpleXMLElement
{
    public function getChildByAttributeValue($name, $value) {
        foreach($this as $child)
        {
            if ($value === (string) $child[$name]) {
                return $child;
            }
        }
    }
}

然后,您可以使用MyElement的{​​{1}} 代替

SimpleXMLElement

并将您的值映射到数组:

$xml = simplexml_load_string($buffer, 'MyElement');
                                      ###########

鉴于$map = function(MyElement $subject) { return [ (string) $subject['value'], (string) $subject->getChildByAttributeValue('type2', 'isin')['value'], ]; }; print_r(array_map($map, $xml->xpath('//subject'))); 是您提供的XML(并且删除了编码错误),这会创建以下输出:

$buffer

完整的代码示例(Online Demo):

Array
(
    [0] => Array
        (
            [0] => AAB
            [1] => DK0010247014
        )

    [1] => Array
        (
            [0] => ALM BRAND
            [1] => DK0015250344
        )

    [2] => Array
        (
            [0] => BAVARIAN NORDI
            [1] => DK0015998017
        )

    [3] => Array
        (
            [0] => DFDS
            [1] => DK0010259027
        )

    [4] => Array
        (
            [0] => FLSMIDTH & CO
            [1] => DK0010234467
        )

)