Question

源代码

HTML = "<title>RUU</title>"
reExtraTitle = re.compile("<title[^>]*>([^<]*)</title>", re.IGNORECASE)
mcTitle = reExtraTitle.match(HTML)
if mcTitle:
    print mcTitle.group()
else:
    print "no Title"

正则表达式帮助我

Answer 1

欢迎使用StackOverflow。人们对今天的挫败感非常顽固，我很抱歉。我猜你不是母语为英语的人，对吗？

你的问题符合SSCCE原则，虽然它表明你对研究有点了解，但你实际上并没有提出一个正确的问题，尽管你明白你所追求的是什么。你的答案在re module doc，你应该阅读。

首先需要import re，然后更改

print mcTitle.group()

到

print mcTitle.group(1)

正如其他人所暗示的那样，您或许应该考虑使用dedicated html parser代替using regexp。

标题正则表达式

1 个答案: