Question

from urllib import urlopen
import re
p = re.compile('<h2><a .*?><a .*? href="(.*?)">(.*?)</a>')
text = urlopen('http://python.org/community/jobs').read()
for url, name in p.findall(text):
    print '%s (%s)' % (name, url)

Answer 1

你的正则表达不是你想要的。试试这个：

from urllib import urlopen
import re
p = re.compile(r'<h2><a\s.*?href="(.*?)">(.*?)</a>')
text = urlopen('http://python.org/community/jobs').read()
print text
for url, name in p.findall(text):
    print '%s (%s)' % (name, url)

另外，你这样做的方式可能不是最好的主意。那就是说，我正在回答问题。：）

为什么不打印这个简单的Python程序？

1 个答案: