我有这个插件的两个选项。
(1)nofollow内容中的所有外部链接
和/或
(2)没有跟随此目标文件夹的链接(输入目标文件夹的绝对URL)
在选项2中,链接可以是内部或外部。
可以设置两个选项,既不设置选项,也可以设置单个选项。
if(get_option('my_nofollow') || get_option('my_nofollow_folder')){add_filter('wp_insert_post_data', 'save_my_nofollow' );}
所以当我设置其中一个选项时,我正在设置一个过滤器到下面的功能。我的问题是,如何更改函数以便设置if(2)但不设置(1)我只将nofollow添加到与目标文件夹URL匹配的链接中?
function save_my_nofollow($content) {
$my_folder = get_option('my_nofollow_folder');
$matches = array();
preg_match_all('~<a.*>~isU',$content["post_content"],$matches);
for ( $i = 0; $i <= sizeof($matches[0]); $i++){
if ( isset($matches[0][$i]) && (preg_match('~' . $my_folder . '~', $matches[0][$i])
|| !preg_match( '~'.get_bloginfo('url').'~',$matches[0][$i]))){
$result = trim($matches[0][$i],">");
$result .= ' rel="nofollow">';
$content["post_content"] = str_replace($matches[0][$i], $result, $content["post_content"]);
}
}
return $content;
}
更新后的代码和最佳答案:
if(get_option('rseo_nofollow')
|| get_option('rseo_nofollow_folder')){
add_filter('wp_insert_post_data', 'save_rseo_nofollow' );
}
function save_rseo_nofollow($content) {
$folder = get_option('rseo_nofollow_folder');
$externalNoFollow = get_option('rseo_nofollow_external');
$folderNoFollow = get_option('rseo_nofollow_folder');
$extRegex = '~'.preg_quote(get_bloginfo('url'), '~') . '~i';
$intRegex = '~'.preg_quote($folder, '~') . '~i';
$dom = new DomDocument();
libxml_use_internal_errors(true);
$dom->loadXml('<root>' . $content['post_content'] . '</root>');
$links = $dom->getElementsByTagName('a');
foreach ($links as $link) {
$href = $link->getAttribute('href');
if ($href && $externalNoFollow && !preg_match($extRegex, $href)) {
$link->setAttribute('rel', 'nofollow');
} elseif ($href && $folderNoFollow && preg_match($intRegex, $href)) {
$link->setAttribute('rel', 'nofollow');
}
}
// print $dom->saveXml();die;
//Since we want to strip the root element, we must do so:
$newContent = '';
$root = $dom->getElementsByTagName('root')->item(0);
foreach ($root->childNodes as $child) {
$newContent .= $dom->saveXml($child);
}
$content['post_content'] = $newContent;
return $content;
}
输入
This is the <a href="http://cnn.com">test</a>. This is the test.
输出
This is the <a rel="nofollow" href=""http://cnn.com"">test</a>. This is the test.
答案 0 :(得分:1)
不要使用正则表达式解析HTML。这不是一个好主意......而是使用Dom功能。请注意,您可能需要将内容包装在外部根标记中(我在此处添加了<root>
)(。
$externalNoFollow = get_option('my_nofollow_external');
$folderNoFollow = get_option('my_nofollow_folder');
$extRegex = '~'.preg_quote(get_bloginfo('url'), '~') . '~i';
$intRegex = '~'.preg_quote($folder, '~') . '~i';
$dom = new DomDocument();
libxml_use_internal_errors(true);
if (!$dom->loadHtml('<html><body>' . $content['post_content'] . '</body></html>')) {
/** Error out, since the loading failed.
Make sure `$content['post_content']` is valid html
**/
die('Invalid HTML detected');
}
$links = $dom->getElementsByTagName('a');
foreach ($links as $link) {
$href = $link->getAttribute('href');
if ($href && $externalNoFollow && !preg_match($extRegex, $href)) {
$link->setAttribute('rel', 'nofollow');
} elseif ($href && $folderNoFollow && preg_match($intRegex, $href)) {
$link->setAttribute('rel', 'nofollow');
}
}
//Since we want to strip the root element, we must do so:
$newContent = '';
$root = $dom->getElementsByTagName('body')->item(0);
foreach ($root->childNodes as $child) {
$newContent .= $dom->saveXml($child);
}
$content['post_content'] = $newContent;
return $content;
注意,您应该添加无效HTML的实际错误处理...