我正在用PHP抓取一个网站。我已经删除了所有必需的数据,但我无法抓取span标签文本。
预期输出: Apr 20,2017
<span title="" data-toggle="tooltip" data-original-title="Posted On">
<i class="calendar rz-calendar"></i>Apr 20, 2017
</span>
$html = file_get_contents($url);
libxml_use_internal_errors( true);
$doc = new DOMDocument;
$doc->loadHTML($html);
$xpath = new DOMXpath( $doc);
$node = $xpath->query( '//span[@data-original-title="Posted ON"]');
答案 0 :(得分:2)
您的代码中的问题//span[@data-original-title="Posted ON"]
ON
不是大写字母On
<?php
ini_set('display_errors', 1);
$doc = new DOMDocument;
$doc->loadHTML('<span title="" data-toggle="tooltip" data-original-title="Posted On">
<i class="calendar rz-calendar"></i>Apr 20, 2017
</span>');
$xpath = new DOMXpath( $doc);
$nodeList = $xpath->query( '//span[@data-original-title="Posted On"]');
foreach($nodeList as $node)
{
echo trim($node->textContent);
}