Question

我想在此页面上获取当前的“最大”预测值：http://www.bom.gov.au/nsw/forecasts/sydney.shtml?ref=hdr并通过Python打印该值。

我该怎么做呢？（抱歉，如果这是一个非常基本的问题，对编程来说很新！）

N.B我目前在Mac上安装了Python 3.3.0

Answer 1

使用lxml.html：

In [1]: import lxml.html as lh

In [2]: doc = lh.parse('http://www.bom.gov.au/nsw/forecasts/sydney.shtml?ref=hdr')

In [3]: doc.xpath('.//div[@class="day main"]//em[@class="max"]/text()')
Out[3]: ['21']

但是，我想你真正想做的是看看web scraping tutorial。

如何从HTML页面获取内容并在Python中打印？

1 个答案: