Question

我希望使用正则表达式和simplehtmldom从此页面获取文章的标题：http://laperuanavegana.wordpress.com/about/

在这种情况下标题是：CómopreprarTSITITÁN

这是我的正则表达式：

$html = file_get_html($url);
preg_match_all("title=(.*?)",$html->innertext,$title);
echo "this is title ".$title[0][0]."<br>";

如果有人帮我找到错误会很有帮助。

Answer 1

我认为您需要在<title>和</title>之间查找文字，而不是title=之后的文字。

例如：

$html = "Sometext<title>Seitan</title>More text";
preg_match_all('|<title>(.*?)</title>|',$html,$title);
echo "this is title ".$title[1][0]."<br>";