Question

我正在尝试使用simplehtmldom编写一个Web scraper。我想通过搜索标签的内容来获取标签。这是它里面的明文，而不是标签的类型。然后，一旦我通过搜索其纯文本的内容获得标记，我想在此之后获得下一个标记。

如何根据内容找到标签？一旦我拥有它，我如何找到以下标签？

任何帮助都将不胜感激。

感谢。

Answer 1

以下内容可让您搜索所有文本节点，然后获取下一个标记：

// Use Simple_HTML_DOM special selector 'text'
// to retrieve all text nodes from the document
$textNodes = $html->find('text');
$foundTag = null;

foreach($textNodes as $textNode) {
    if($textNode->plaintext == 'Hello World') {
        // Get the parent of the text node
        // (A text node is always a child of
        //  its container)
        $foundTag = $textNode->parent();
        break;
    }
}

if($foundTag) {
    $nextTagAfter = $foundTag->next_sibling();
}

这不是您关于基本 Simple_HTML_DOM 用法的第一个问题。您可能需要read the official documentation。

你如何在simplehtmldom中搜索标签的内容？

1 个答案: