你如何在Python中引用DIV(HTMLParsing)?

时间:2013-08-22 01:54:48

标签: python python-3.x html-parsing beautifulsoup

以下是网站:http://scores.suntimes.com/sports-scores/Hockey-Scores-Matchups.aspx?Year=2013&Period=0119&CurrentSeason=2012-2013

现在,与我之前做过的雅虎页面不同,这里的日期似乎是这样的:

<div id="Scoreboard_6_National_Hockey_League">
         <div class="sdi-title-page-section">National Hockey League</div>
         <div class="sdi-nfl-dates">Saturday, January 19, 2013</div>

因此,如果我想提取该日期,我将如何在Python中说出来?

1 个答案:

答案 0 :(得分:1)

使用BeautifulSoup

>>> from bs4 import BeautifulSoup as BS
>>> html = """<div id="Scoreboard_6_National_Hockey_League">
...          <div class="sdi-title-page-section">National Hockey League</div>
...          <div class="sdi-nfl-dates">Saturday, January 19, 2013</div>"""
>>> soup = BS(html)
>>> print soup.find('div', {'class':'sdi-nfl-dates'}).text
Saturday, January 19, 2013