Question

我正在学习网页搜刮。我想知道我们如何从下面的元素中获取参与者计数？

<li class="header-hero__stat header-hero__stat--participants">
   ::before
   "255,590 Participants"
   ::after
</li>

我尝试过的代码

soupy = bs(html,'lxml') 
ul = soupy.find('li',{'class':"header-hero__stats"})

返回None

Answer 1

这不是伪元素的内容，而是li节点的文本内容，所以

li = soup.find('li',{'class':"header-hero__stat--participants"}).text

应该足以提取'255,601 Participants'

使用.text.split()[0]仅获取数字