Question

我试图提取所有＆＃34;名称＆＃34;和＆＃34; form13FFileNumber＆＃34;来自xpath＆＃34; // otherManagers2Info / otherManager2 / otherManager＆＃34;的值在本文件中： https://www.sec.gov/Archives/edgar/data/1067983/000095012314002615/primary_doc.xml

这是我的代码。知道我在这里做错了吗？

$xml = file_get_contents($url);

$dom = new DOMDocument();

$dom->loadXML($xml);

$x = new DOMXpath($dom);

$other_managers = array();

$nodes = $x->query('//otherManagers2Info/otherManager2/otherManager');

if (!empty($nodes)) {
    $i = 0;

    foreach ($nodes as $n) {
        $i++;

        $other_managers[$i]['form13FFileNumber'] = $x->evaluate('form13FFileNumber', $n)->item(0)->nodeValue;
        $other_managers[$i]['name'] = $x->evaluate('name', $n)->item(0)->nodeValue;
    }
}

Answer 1

就像您在评论中发布的那样，您可以使用Xpath的自己的前缀注册命名空间。命名空间前缀只是别名。这里没有Xpath中的默认命名空间，因此您必须始终注册并使用前缀。

但是，表达式总是返回一个可遍历的节点列表，您可以使用foreach来迭代它们。 query()和evaluate()将上下文节点作为第二个参数，表达式相对于上下文。最后evaluate()可以直接返回标量值。如果将Xpath中的节点列表转换为标量类型（如字符串）或使用count()之类的函数，则会发生这种情况。

$dom = new DOMDocument();
$dom->loadXml($xml);

$xpath = new DOMXpath($dom);
$xpath->registerNamespace('e13', 'http://www.sec.gov/edgar/thirteenffiler');
$xpath->registerNamespace('ecom', 'http://www.sec.gov/edgar/common');

$result = [];
$nodes = $xpath->evaluate('//e13:otherManagers2Info/e13:otherManager2/e13:otherManager');
foreach ($nodes as $node) {
  $result[] = [
    'form13FFileNumber' => $xpath->evaluate('string(e13:form13FFileNumber)', $node),
    'name' => $xpath->evaluate('string(e13:name)', $node),
  ];
}

var_dump($result);

演示：https://eval.in/125200

无法使用XPath从XML文档中提取数据

1 个答案: