Question

我从HTMl内容中获取单独的所有URL链接使用此代码

$doc = new DOMDocument();
$doc->loadHTML($string);
$anchorTags = $doc->getElementsByTagName('a');
$links = array();
foreach ($anchorTags as $url) {
    $source = parse_url($url->getAttribute('href'));
    $source = preg_replace('/^www\./', '', $source['host']);
    $links[$source][$url->getAttribute('href')] = $url->nodeValue;
}

使用以上代码输出。

Array
(
    [Facebook] => Array
                (
                    [facebook.com] => https://www.facebook.com/
                )

    [Google] => Array
                (
                    [google.com] => https://www.google.com/
                )

    [] => Array
        (
            [] =>
         )

    [yahoo] => Array
            (
                [yahoo.com] => https://www.yahoo.com/
            )

)

我只想从数组中删除null / blank元素/ index / key 为此，我使用array_filter();

但没有得到解决方案。

print_r(array_filter($links));

Answer 1

只需添加检查值的条件：

$links = array();
foreach ($anchorTags as $url) {
    $source = parse_url($url->getAttribute('href'));
    $source = preg_replace('/^www\./', '', $source['host']);
    if($source != null && $source != "" && $url->nodeValue != null && $url->nodeValue != ""){
         $links[$source][$url->getAttribute('href')] = $url->nodeValue;
    }
}

Answer 2

或者，更优雅一点，如果它是空的，甚至不会将结果推送到你的数组：

if ($source != "") $links[$source][$url->getAttribute('href')] = $url->nodeValue;

Answer 3

你可以试试这个，

    // Remove empty elements
foreach($links as $key => $val){
    if($val == '')
    {
        unset($val);
    }
}

Answer 4

你可以查看strlen，就像这样

print_r(array_filter($links, 'strlen' ));

从数组中删除null / blank元素/ index / key

4 个答案: