DOMDocument :: loadHTML():作为输入提供的空字符串

时间:2016-10-27 23:20:49

标签: php

我搜索了无数页面,试图找到真正有用的答案。我已经尝试过库文件来专门处理警告和错误处理,但即使我压制所有警告和错误,这个最终警告仍然显示:

Warning: DOMDocument::loadHTML(): Empty string supplied as input

我的php处理如下。只要用户输入实际网址,代码就能完美运行,但是当用户输入的数据不是网址时,会显示上面的警告。

if (isset($_GET[article_url])){
    $title = 'contact us';
    $str = @file_get_contents($_GET[article_url]);
    $test1 = str_word_count(strip_tags(strtolower($str)));
    if($test1 === FALSE) { $test = '0'; }
    if ($test1 > '550') {
        echo '<div><i class="fa fa-check-square-o" style="color:green"></i> This article has '.$test1.' words.';
    } else {
        echo '<div><i class="fa fa-times-circle-o" style="color:red"></i> This article has '.$test1.' words. You are required to have a minimum of 500 words.</div>';
    }

    $document = new DOMDocument();
    $libxml_previous_state = libxml_use_internal_errors(true);
    $document->loadHTML($str);
    libxml_use_internal_errors($libxml_previous_state);

    $tags = array ('h1', 'h2');
    $texts = array ();

    foreach($tags as $tag)
    {
        $elementList = $document->getElementsByTagName($tag);
        foreach($elementList as $element)
        {
            $texts[$element->tagName] = strtolower($element->textContent);
        }
    }

    if(in_array(strtolower($title),$texts)) {
        echo '<div><i class="fa fa-check-square-o" style="color:green"></i> This article used the correct title tag.</div>';
    } else {
    echo "no";
    }
}

如何抑制此警告?

似乎这个建议似乎是停止抑制警告,而是修复它们,所以当我停止压制它们时,我会列出所有警告

Warning: DOMDocument::loadHTML(): htmlParseEntityRef: expecting ';' in Entity
Warning: DOMDocument::loadHTML(): htmlParseStartTag: misplaced <body> tag in Entity
Warning: DOMDocument::loadHTML(): Tag header invalid in Entity
Warning: DOMDocument::loadHTML(): Tag section invalid in Entity
Warning: DOMDocument::loadHTML(): error parsing attribute name in Entity
Warning: DOMDocument::loadHTML(): Tag footer invalid in Entity
Warning: DOMDocument::loadHTML(): htmlParseEntityRef: no name in Entity
DOMDocument::loadHTML(): Unexpected end tag : strong in Entity

请记住,我正在扫描用户输入网址,因此我无法控制正在测试的网页格式 - 这意味着我无法修复他们的代码。

那么如果不禁止警告我该怎么办?

2 个答案:

答案 0 :(得分:3)

好吧@Bruce ..我现在明白了这个问题。你想要做的是测试file_get_contents()

的值
<?php
error_reporting(-1);
ini_set("display_errors", 1);

$article_url = 'http://google.com';
if (isset($article_url)){
  $title = 'contact us';
  $str = @file_get_contents($article_url);
  // return an error
  if ($str === FALSE) {
    echo 'problem getting url';
    return false;
  }

  // Continue
  $test1 = str_word_count(strip_tags(strtolower($str)));
  if ($test1 === FALSE) $test = '0';

  if ($test1 > '550') {
    echo '<div><i class="fa fa-check-square-o" style="color:green"></i> This article has ' . $test1 . ' words.';
  } else {
    echo '<div><i class="fa fa-times-circle-o" style="color:red"></i> This article has ' . $test1 . ' words. You are required to have a minimum of 500 words.</div>';
  }

  $document = new DOMDocument();
  $libxml_previous_state = libxml_use_internal_errors(true);
  $document->loadHTML($str);
  libxml_use_internal_errors($libxml_previous_state);

  $tags = array ('h1', 'h2');
  $texts = array ();

  foreach($tags as $tag) {
    $elementList = $document->getElementsByTagName($tag);
    foreach($elementList as $element) {
      $texts[$element->tagName] = strtolower($element->textContent);
    }
  }

  if (in_array(strtolower($title),$texts)) {
    echo '<div><i class="fa fa-check-square-o" style="color:green"></i> This article used the correct title tag.</div>';
  } else {
    echo "no";
  }
}
?>

所以if ($str === FALSE) { //return an error }并且不要让脚本继续。你可以像我正在做的那样返回假,或者只是做一个if / else。

答案 1 :(得分:0)

return $this->subject($data['subject'])->markdown('mails.send_instant_notification', compact('data'));

如果您将markdown用作模板构建,请不要包括内联样式。但是,如果您使用的是视图,则可以包含内联CSS样式。

return $this->subject($data['subject'])
                ->view('mails.send_instant_notification', compact('data'));