以下是我网站上显示的HTML
:
<meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />
如何使用lxml.html来抓住它?
答案 0 :(得分:2)
使用xpath按meta
属性查找name
代码,并获取content
属性的值:
from lxml.html import fromstring
html_data = """ <meta content="auth" name="param" />
<meta content="I_WANT_THIS" name="token" />"""
tree = fromstring(html_data)
print tree.xpath('//meta[@name="token"]/@content')
打印:
['I_WANT_THIS']