Question

给出以下HTML字符串，由lxml解析：

<strong class="footer"> 
  <span class="icon-new"><i class="icon-new"/></span> 
  16 
</strong>

如何使用XPath提取数字 16 ？

Answer 1

as＆＃39; 16＆＃39;文字位于strong标记下，您可以这样做：

>>> root = etree.fromstring(html)
>>> root.xpath('//strong[@class="footer"]/text()[normalize-space()]')[0].strip()
'16'

Answer 2

使用纯Xpath

//strong[@class='footer']/normalize-space(text()[position()=last()])