Question

我有一个包含以下内容的html页面：

<a href="coord/ag13.dat">ag13.dat</a>\ Drela AG13 airfoil \<a href="afplots/ag13.gif">ag13.gif</a>\ \ \ <a href="ref/misc_refs.html#3">Ref [3]</a><br>

我需要提取文字“\ Drela AG13翼型”。我怎样才能做到这一点？到目前为止，我已设法提取可能对我有用的链接内容

    include "simple_html_dom.php";

    $html = file_get_html('http://aerospace.illinois.edu/m-selig/ads/coord_database.html');
    foreach($html->find('a[href*=coord\/]') as $element) 
    echo $element->href. ;

p.s：我从未参与过这样的项目，所以这对我来说都是新的。

Answer 1

使用innertext，如下所示：

foreach($html->find('a[href*=coord\/]') as $element) 
    echo $element->innertext. '<br>';

dom解析器从非语义html页面中提取链接和特定文本（没有id或类等）

1 个答案: