Question

我有以下正则表达式：

>>> re.findall('http://www.rottentomatoes.com/.+', html)
['http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>']

如何在"之前将其匹配。我想让回报成为：

http://www.rottentomatoes.com/m/1129132-torque

Answer 1

使用非贪婪量词?停在第一个"：

>>> html = 'http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>'
>>> re.search('(http://www\.rottentomatoes\.com/.+?)"', html).group(1)
'http://www.rottentomatoes.com/m/1129132-torque'

Answer 2

只需在要停止的位置添加字符（“）。同时添加?，以便在第一场比赛时停止。

>>> html='http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>'
>>> re.findall('http://www.rottentomatoes.com/.+?\"', html)
['http://www.rottentomatoes.com/m/1129132-torque"']

改进正则表达式以结束报价

2 个答案: