Question

我想从以下这些标记中以"http://example.com/category/"开头的网页获取网址：

<td><a href="http://example.com/category/subcategory/product/257849" title="Sample Title">Test</a></td>

注意：

257849 =随机数

非常感谢任何建议。

谢谢！

Answer 1

只需在正则表达式中指定固定的基本网址asis，然后使用[\w/]+匹配字母，数字和/斜杠的任意组合：

preg_match('#http://example.com/category/[\w/]+#', $text, $match);
print $match[0];

要一次提取所有网址，请改用preg_match_all()。

Answer 2

preg_match_all('#http://example.com/category[^"]+#', $text, $a);

结果将在$ a