Question

我试图用Xpath解析一个页面，但我没有设法获得正文。

以下是我的尝试：

<?php

$url = 'http://figurinepop.com/mickey-paintbrush-disney-funko';
$html = file_get_contents($url);
$doc = new DOMDocument();
$doc->loadHTML($html);

$xpath = new DOMXpath($doc);

$nodes = $xpath->query('//link[@rel="canonical"]/@href'); 
foreach($nodes as $node) { 
    $canonical = $node->nodeValue; 
}
$nodes = $xpath->query('//html/body/@class'); 
foreach($nodes as $node) { 
   $bodyclass = $node->nodeValue; 
}

$output['canonical'] = $canonical;
$output['bodyclass'] = $bodyclass;

echo '<pre>'; print_r ($output); echo '</pre>';

?>

这是我得到的：

Array
(
     [canonical] => http://figurinepop.com/mickey-paintbrush-disney-funko
     [bodyclass] => 
)

它正在使用许多元素（标题，规范，div ......）但是身体类。我已经使用chrome扩展程序对Xpath查询进行了测试，看起来写得很好。

有什么问题？

如何用Xpath解析body类？

0 个答案: