应用错误收集

我在使用lxml抓取数据方面遇到了一些困难。

代码段是：

from lxml import etree

parser = etree.HTMLParser()
f = open("test1.html", "r")
content = f.read()
tree   = etree.fromstring(content, parser)
print tree.xpath('//table[@id="table_search_results"]/tbody/tr')

我的html片段是：

<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" class=" gecko ff3 linux js gecko ff3 linux js">
 ............
 ............

 <table cellspacing="0" cellpadding="0" border="0" height="100" id="table_search_results"><tbody><tr></tr></tbody></table> </html>

但我无法从我的python代码中获取表数据。 print语句显示为null，而显然我的表元素具有给定的id。

请帮忙。

使用Python和LXML刮擦数据

0 个答案: