Question

我想提取强标签中包含的所有字符串并将其保存在文件中。但我无法提取强大的标签;它只返回空数组。我错过了什么。这是我使用的代码：输入：

>>>Import requests
>>>from bs4 import BeautifulSoup
>>>r=requests.get(' http://www.webmd.com/heart-disease/heart-disease-glossary#1')
>>>r.content
>>> soup=BeautifulSoup(r.content,"lxml")
>>>print(soup)
>>>soup.find_all('strong')

最后一项陈述的输出：

[]

谢谢，

Answer 1

实际上，汤中的 强标记。

在浏览器中打开网址时，您可以获得两个强大的代码。但我认为你不需要它们。

<li xmlns="http://www.w3.org/1999/xhtml">Check the address for typing errors such as <strong>ww</strong>.example.com instead of <strong>www</strong>.example.com</li>

如果您想以粗体显示所有条款，则应使用.find_all（'b'）。

[tag.text.strip().replace(':',"") for tag in soup.find_all('b')]

输出：

['Ablation','Advance Directive (living will)','Aerobic Exercise', ...]

使用beautifulsoup查找所有Strong标签将返回一个空数组

1 个答案: