如何获取媒体:使用SimpleXML的内容

时间:2012-07-25 11:11:09

标签: php parsing simplexml

我有一个XML Feed,我可以在新闻网站上获取有关塞浦路斯的新闻。 我想使用新闻图片,当然还有新闻本身。

以下是xml的示例

 <item>
      <title>Rumlardan KKTC üniversitelerine ambargo</title>
      <description>
        <![CDATA[<p><a href="http://www.ntvmsnbc.com/id/25368624/">
          <img align="left" border="0" src="http://media.ntvmsnbc.com/j/NTVMSNBC/Components/ArtAndPhoto-Fronts/Sections-StoryLevel/Dünya/Kıbrıs/120723girne.thumb.jpg" alt="" style="margin:0 5px 5px 0" /></a> 
          Kıbrıs Rum yönetimi, KKTC'deki üniversitelerle işbirliği yapan ülkelerin eğitim kurumlarına resmi yazılar göndererek yetkilileri tehdit ediyor. Hindistan üniversitesinden ortak akademik programlara son verilmesi istendi.</p><br clear="all" />]]>
      </description>
      <link>http://www.ntvmsnbc.com/id/25368624/</link>
      <media:content medium="image" url="http://media.ntvmsnbc.com/j/NTVMSNBC/Components/ArtAndPhoto-Fronts/Sections-StoryLevel/Dünya/Kıbrıs/120723girne.thumb.jpg">
        <media:text>
          <![CDATA[<p><a href="http://www.ntvmsnbc.com/id/25368624/">
            <img align="left" border="0" src="http://media.ntvmsnbc.com/j/NTVMSNBC/Components/ArtAndPhoto-Fronts/Sections-StoryLevel/Dünya/Kıbrıs/120723girne.thumb.jpg" alt="" style="margin:0 5px 5px 0" />
            </a>
            </p>
            <br clear="all" />]]>
        </media:text>
      </media:content>
      <pubDate>Mon, 23 Jul 2012 14:27:32 GMT</pubDate>
      <category>Haberler</category>
      <guid isPermaLink="true">http://www.ntvmsnbc.com/id/25368624/</guid>
    </item>

xml的链接是:http://www.ntvmsnbc.com/id/24928068/device/rss/rss.xml

现在我用simpleXML解析这个xml并使用下面的代码。在此代码上,我可以阻止使用strip_标签显示<img>标签,我只能在CDATA中显示文本数据。

简而言之,我要问的是如何将media:content url=""放入我的代码中,因为我想将thumb.jpg更改为hlarge.jpg。

我尝试了$d['media'] = $news->media->attributes(); '<p>'.$post['media']['url'].,但它无效

这是我的代码:

<? 
$NewsFeedUrl =  "http://www.ntvmsnbc.com/id/24928068/device/rss/rss.xml";

$xml = @simplexml_load_file($NewsFeedUrl);

if(is_object($xml)){
    //Rest of our code will be here
}else{
    die('Güncel Haberlere Bağlanılamıyor.');
}

foreach($xml->channel->item as $news){
    if(is_array($newsContent) && count($newsContent)==$amountToShow){
    }
    $description = $news->description;
    $d['title'] =$news->title;
    $d['link'] = $news->link;
    $d['media'] = $news->media->attributes();
    $d['cont'] = $news->description;
    $d['date'] = $news->pubDate;
    $newsContent[]=$d;
}
//$ad=array("thumb", "left");

if(is_array($newsContent)){
    foreach($newsContent as $post){

        echo '

      <article class="entry"><h3>'.'<a href="'.$post['link'].' "target="_blank">'.$post['title'].'</a></h3>
      '.'<div class="meta"><span class="date_post">'.$post['date'].'</span>'.$post['pubDate'].
      //'<p>'.str_replace($ad,"hlarge",$post['cont']).
      '<p>'.$post['cont'].
      '<p>'.strip_tags($post['cont']).
      //'<p>'.$post['media']['url'].
      '<p><a href="'.$post['link'].' "target="_blank" class="button">Devamını Oku</a>'.
      ' </article>';
    }
}else{
    echo '<p>Güncel Haberler Alınamadı Sayfayı Yenilemeyi Deneyin.</p>';
}

?>

2 个答案:

答案 0 :(得分:3)

content元素具有名称空间前缀(<media:content>),因此无法通过常规方式访问它。

“媒体”的名称空间URI来自http://search.yahoo.com/mrss/(请检查rss.xml是否为“xmlns:media”)。

试试这个:

foreach ($xml->channel->item as $news)
{
    $ns_media = $news->children('http://search.yahoo.com/mrss/');

    echo $ns_media->content; // displays "<media:content>"
}

编辑:

我认为名称空间uri“http://search.yahoo.com/mrss/

存在一些问题

我尝试使用你的 xml:

http://codepad.org/P90bOQUj [不工作]

我尝试使用其他 xml:

http://codepad.org/ADYveL6T [工作]

答案 1 :(得分:0)

我也有这个问题,我使用以下解决方案进行了测试:

https://gist.github.com/enderandpeter/6b760140bf9d2ed9620c

成功了!

$content = $xml->channel->item->children('media', true)->content;

$contentattr = $content->attributes();