Question

我有这段代码：

url = "http://www.padtube.com/Audio-Music-Editor/10-75359.html"
pageurl = urllib.urlopen(url)
soup = BeautifulSoup(pageurl)

for table in soup.select("table#product-quickfacts-table"):
    print table.find('meta',{'itemprop':'datePublished'})

当我运行此代码时，它会给我输出结果：

<meta content="2012-03-01T00:00:00-05:00" itemprop="datePublished"/>

我如何只拍摄日期？

Answer 1

你的行

print table.find('meta',{'itemprop':'datePublished'})

<meta content="2012-03-01T00:00:00-05:00" itemprop="datePublished"/>

返回包含属性itemprop='datePublished'的元素元。您只想访问此xml元素的content节点

print table.find('meta',{'itemprop':'datePublished'})['content']

2012-03-01T00:00:00-05:00

如何从网页上获取发布日期？

1 个答案: