现在,与我之前做过的雅虎页面不同,这里的日期似乎是这样的:
<div id="Scoreboard_6_National_Hockey_League">
<div class="sdi-title-page-section">National Hockey League</div>
<div class="sdi-nfl-dates">Saturday, January 19, 2013</div>
因此,如果我想提取该日期,我将如何在Python中说出来?
答案 0 :(得分:1)
使用BeautifulSoup
:
>>> from bs4 import BeautifulSoup as BS
>>> html = """<div id="Scoreboard_6_National_Hockey_League">
... <div class="sdi-title-page-section">National Hockey League</div>
... <div class="sdi-nfl-dates">Saturday, January 19, 2013</div>"""
>>> soup = BS(html)
>>> print soup.find('div', {'class':'sdi-nfl-dates'}).text
Saturday, January 19, 2013