无法使用PHP Simple HTML DOM Parser的find()函数

时间:2017-01-05 10:48:10

标签: php dom simple-html-dom

我正在尝试抓取远程网站并编辑部分结果,然后更新数据库中的几个表,然后 echo()查看最终文档。

以下是有问题的代码的编辑片段供参考:

<?php

require_once 'backend/connector.php';
require_once 'table_access/simplehtmldom_1_5/simple_html_dom.php';
require_once 'pronunciation1.php';

// retrieve lookup term
if(isset($_POST["lookup_term"])){ $term = trim($_POST["lookup_term"]); }
else { $term = "hombre"; }

$html = file_get_html("http://www.somesite.com/translate/" . rawurlencode($term));
$coll_temp = $html->find('div[id=translate-en]');
$announce = $coll_temp[0]->find('.announcement');
$quickdef = $coll_temp[0]->find('.quickdef');
$meaning = $announce[0] . $quickdef[0];

$html->clear(); // release scraper variable to prevent memory leak issues
unset($html); // release scraper variable to prevent memory leak issues
$meaning = '<?xml version="1.0" encoding="ISO-8859-15"?>' . $meaning;

// process the newly-created DOM
$dom = new DOMDocument;
$dom->loadHTML($meaning);
// various DOM-manipulation code snippets

// extract the quick definition section
foreach ($dom->find('div[class=quickdef]') as $qdd) {
    $qdh1 = $qdd->find('.source')[0]->find('h1.source-text');
    $qdterm = $qdh1[0]->plaintext;
    $qdlang = $qdh1[0]->getAttribute('source-lang');
    add2qd($qdterm, $qdd, $qdlang);
    unset($qdterm);
    unset($qdlang);
    unset($qdh1);
}

$finalmeaning = $dom->saveHTML(); // store processed DOM in $finalmeaning
push2db($term, $finalmeaning); // add processed DOM to database
echo $finalmeaning; // output processed DOM

// release variables
unset($dom);
unset($html);
unset($finalmeaning);
function add2qd($lookupterm, $finalqd, $lang){
    $connect = dbconn(PROJHOST, CONTEXTDB, PEPPYUSR, PEPPYPWD);
    $sql = 'INSERT IGNORE INTO tblquickdef (word, quickdef, lang) VALUES (:word, :quickdef, :lang)';
    $query = $connect->prepare($sql);
    $query->bindParam(':word', $lookupterm);
    $query->bindParam(':quickdef', $finalqd);
    $query->bindParam(':lang', $lang);
    $query->execute();
    $connect = null;
}
function push2db($lookupword, $finalmean) {
    $connect = dbconn(PROJHOST, DICTDB, PEPPYUSR, PEPPYPWD);
    $sql = 'INSERT IGNORE INTO tbldict (word, mean) VALUES (:word, :mean)';
    $query = $connect->prepare($sql);
    $query->bindParam(':word', $lookupword);
    $query->bindParam(':mean', $finalmean);
    $query->execute();
    $connect = null;
}

?>

//提取快速定义部分下的 for 循环外,代码工作正常。从此循环内部调用的函数是 add2qd(),它接受3个字符串值作为输入。

每次循环运行时,PHP都会抛出致命错误,因为它认为 find()未定义。我知道find是PHP Simple HTML DOM Parser库中的合法函数,因为我在同一代码中多次使用它而没有任何问题(在 // retrieve lookup term 部分中)。我做错了什么?

1 个答案:

答案 0 :(得分:0)

但是你没有使用PHP Simple HTML DOM - 只有标准的PHP DOMDocument,它没有找到方法。

$dom = new DOMDocument;
$dom->loadHTML($meaning);

foreach ($dom->find('div[class=quickdef]') as $qdd) {

http://php.net/manual/en/class.domdocument.php