如何从rss URL获取所有内容

时间:2013-11-26 06:15:55

标签: php database wordpress rss wordpress-plugin

我正在开发一个自动系统,其中一个cron作业自动从RSS提要中获取新鲜内容,然后将它们存储到数据库供以后使用......(可以用作WP帖子)...

一切都很好,但唯一的问题是我只得到了一些小描述。我想从rss Feed中提取完整的帖子内容?不只是摘录。

我正在使用WordPress Not Codeigniter

我使用的PHP代码

RSSLink = http://feeds.feedburner.com/learnhack

 $rss = fetch_feed($entry->rss_link);
   foreach ($rss->get_items() as $item)
    {
        var_dump($item);
        $page_content = array(
            'post_title' => $item->get_title(),
            'post_content' => $item->get_description(),
            );
        // Data base insert statements
     }

OutPut:

SimplePie_Item对象([feed] => SimplePie对象([data] =>数组([child] =>数组([] =>数组([rss] =>数组([0] = >数组([data] => [attribs] =>数组([] =>数组([版本] => 2.0))[xml_base] => [xml_base_explicit] => [xml_lang] = > [child] =>数组([] =>数组([channel] =>数组([0] =>数组([数据] => [attribs] =>数组()[xml_base] ] => [xml_base_explicit] => [xml_lang] => [child] =>数组([] =>数组([title] =>数组([0] =>数组([数据] =>了解道德黑客的基础知识[attribs] => Array()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[link] =>数组([0] = >数组([data] => http://www.basicsofhacking.com/ [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[description] =>数组([0] =>数组([数据] =>了解道德黑客技术:什么是黑客,电子邮件黑客,系统黑客,网站黑客攻击,Facebook黑客攻击,谷歌黑客攻击等[attribs] => Array()[xml_base] => [xml_base_explicit] => [xml_lang] => ))[语言] =>数组([0] =>数组([数据] => en [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[managingEditor] = >数组([0] =>数组([数据] => noreply@blogger.com(Harwinder Kumar)[attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] = >))[lastBuildDate] =>数组([0] =>数组([数据] =>太阳,2013年11月24日08:25:03 PST [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang ] =>))[generator] =>数组([0] =>数组([数据] => Blogger http://www.blogger.com [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] =>) )[image] =>数组([0] =>数组([数据] => [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] => [child] =>数组([] =>数组([link] =>数组([0] =>数组([数据] => http://creativecommons.org/licenses/by/3.0/ [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[url] =>数组([0] =>数组([数据] => http://creativecommons.org/images/public/somerights20.gif [attribs] =>数组() [xml_base] => [xml_base_explicit] => [xml_lang] =>))[title] =>数组([0] =>数组([数据] =>保留一些权利[attribs] => ; Array()[xml_base] => [xml_base_explicit] => [xml_lang] =>))))))[item] =>数组([0] =>数组([数据] => [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang] => [child] =>数组([] =>数组([title] =>数组([0] =>数组([数据] => WordPress安全:保护黑客/未来攻击网站[attribs] =>数组()[ xml_base] => [xml_base_explicit] => [xml_lang] =>))[link] =>数组([0] =>数组([数据] => http://feedproxy.google.com/~r/learnhack/~3/nSMFsPWxWQQ/wordpress-security-securing-sites-from-hackers.html [attribs] = > Array()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[category] ​​=>数组([0] =>数组([数据] =>道德基础HACKING [attribs] => Array()[xml_base] => [xml_base_explicit] => [xml_lang] =>)[1] =>数组([data] => WORDPRESS TRICKS [attribs] =&gt ; Array()[xml_base] => [xml_base_explicit] => [xml_lang] =>))[作者] =>数组([0] =>数组([数据] => noreply @ blogger。 com(Harwinder Kumar)[attribs] => Array()[xml_base] => [xml_base_explicit] => [xml_lang] => ))[pubDate] =>数组([0] =>数组([数据] =>星期三,2013年10月30日09:59:56 PDT [attribs] =>数组()[xml_base] => [xml_base_explicit] => [xml_lang ] =>))[guid] =>数组([0] =>数组([数据] =>标记:blogger.com,1999:blog-8198217290464183069.post-3028513166465596533 [attribs] =>数组([] =>数组([isPermaLink] = > false))[xml_base] => [xml_base_explicit] => [xml_lang] =>))[description] =>数组([0] =>数组([数据] =>由于WordPress是网络上最受欢迎的CMS,但如果我们不遵守必要的安全措施,也容易受到威胁。帖子,Sarah Rexman提到了一些关于保护WordPress的技巧,在这篇文章中,我将分享我自己的经验。在oDesk,Elance和Freelancer担任自由职业者;客户总是有关于从黑客保护他们的网站的问题,并询问如何防止...

[attribs] => Array()[xml_base] => [xml_base_explicit] => [xml_lang] => )))[http://search.yahoo.com/mrss/] =>数组([缩略图] =>数组([0] =>数组([数据] => [attribs] =>数组([] =>数组([url] => http://2.bp.blogspot.com/-meTNpj8B758/UOR5j1OmE5I/AAAAAAAAAvk/UtCMCLa_C3Q/s72-c/WordPress+Security.jpg [height] => 72 [宽度] => 72))

此外,我想将RSS帖子图像存储在我的服务器上,没有热链接

1 个答案:

答案 0 :(得分:1)

RSS源的描述仅包含作者决定放入的内容。这可能是一篇完整的文章,但它也可能只是对某些内容的总结。

如果您需要完整的文章,我认为您几乎无法通过link元素中的网址获取任何内容。

对于图像和其他媒体文件,您可能需要解析HTML,并手动下载每个元素。更不用说重写所有路径......祝你好运...


此外,如果您正在将自己的博客写入自己的willy-nilly博客,那么请停止...