PHP帮助preg_match模式

时间:2011-08-05 22:35:05

标签: php preg-match design-patterns

这是我一直在努力开发的插件的核心。虽然我的模式遇到了一些麻烦..

<?php

$s = '
<embed type="application/x-shockwave-flash" id="single2" name="single2" src="http://api.realitylapse.com/player.swf" allowscriptaccess="always" allowfullscreen="true" wmode="transparent" flashvars="file=http://cerium.realitylapse.com/stream/bea352a230ebd36b52dc27d874f41f5a/4e3c5eca/default/xxxxx/xxxxx-lq.mp4&amp;plugins=ltas&amp;ltas.cc=inhldvymihzxqln&amp;provider=http" height="424" width="659">

<embed type="application/x-shockwave-flash" src="http://www.xxxxx.com/player9397/player.swf?" quality="high" allowfullscreen="true" allowscriptaccess="always" wmode="opaque" flashvars="provider=http&amp;file=http://www.xxxxx.com/player9397/vb.php?id=TT175YivmF4y&amp;type=video&amp;backcolor=111111&amp;frontcolor=cccccc&amp;lightcolor=DE4949&amp;stretching=fill" height="420" width="99%">

<embed src="http://www.megavideo.com/v/P5X0UOA267fb79acd04cdb29a057c3fa0066573a1" type="application/x-shockwave-flash" allowfullscreen="true" width="100%" height="438">    
';


$patterns = array();
$patterns[] = '<embed[^>]+src=["\'](.+?)["\']';
$patterns[] = '<embed[^>]+data=["\'](.+?)["\']';  
$patterns[] = '<embed[^>]+flashvars="(.+?)["\']'; //Possible problem..
$patterns[] = '<embed[^>]+file=(.+?)[&]';         //
$patterns[] = '<iframe[^>]+src=["\'](.+?)["\']';
$patterns[] = '<iframe[^>]+data=["\'](.+?)["\']';
$patterns[] = '<object[^>]+src=["\'](.+?)["\']';
$patterns[] = '<object[^>]+data=["\'](.+?)["\']';
$patterns[] = '<video[^>]+src=["\'](.+?)["\']';
$patterns[] = '<video[^>]+data=["\'](.+?)["\']';
$patterns[] = '<video[^>]+file=(.+?)[&]';



$patterns = "#(?:" . implode("|", $patterns) . ")#si";

preg_match_all($patterns, ($s), $m);
if (!empty($m[0]))
{ 
    $edata = array();
    foreach($m[0] as $match)
    { 

//Embeds:
if (preg_match('#realitylapse.com/stream/(.+?)[&,"\']#si', $match, $id))
     $edata[] = "<!--nextpage--><!--tab_title:CERIUM-->\n[cerium " . $id[1] . "]";

else if (preg_match('#http&amp;file=http://www.xxxx.com/player9397/vb.php?id=(.+?)[&,"\']#si', $match, $id))
     $edata[] = "<!--nextpage--><!--tab_title:UNKNOWN-->\n[vb " . $id[1] . "]";

else if (preg_match('#http://www.megavideo.com/v/(.+)[&"\']#si', $match, $id))
     $edata[] = "<!--nextpage--><!--tab_title:MEGAVIDEO-->\n[megavideo " . $id[1] . "]";

    }

if (isset($edata[0])) {

$embeds = implode("\n", ($edata)); 

print $embeds;

}
    } 

?>

仅输出:

[megavideo P5X0UOA267fb79acd04cdb29a057c3fa0066573a1]

其他所有玩家都有我的比赛。 flashvars区域中的任何内容都没有。这可能不是真正的原因。 ..像megavideo嵌入这样的东西。感谢任何帮助,谢谢!

1 个答案:

答案 0 :(得分:3)

我建议使用html解析器,而不是使用正则表达式: http://simplehtmldom.sourceforge.net/

然后,您可以轻松阅读元素属性。

$html = str_get_html('<embed id="single2" height="424" width="659" flashvars="file=http://cerium.realitylapse.com/stream/bea352a230e" >');

$embed = $html->find('embed', 1);

$embed->height; // == "424"
$embed->flashvars; // == "file=http://cerium.realitylapse.com/stream/bea352a230e"