Question

如何在html标签中获取所有内容？

from bs4 import BeautifulSoup
content = "<a><b>scgvggvd</b></a>"
soup = BeautifulSoup(content, 'html.parser')
matched_list = soup.find('a')
print(matched_list)

上面的

代码将返回：

<a><b>scgvggvd</b></a>

我想要的是：

<b>scgvggvd</b>

找到标签<a>后将其删除

我希望该解决方案也能与find_all（）一起使用

Answer 1

from bs4 import BeautifulSoup
content = "<a><b>scgvggvd</b></a>"
soup = BeautifulSoup(content, 'html.parser')
matched_list = soup.find('a')
for b in matched_list:
    print(b)

Answer 2

如果<b>标签是<a>标签的兄弟姐妹，请使用以下行：

matched_list = soup.select_one('b')

如果<b>标签是<a>标签的子元素，请使用以下行：

matched_list = soup.select_one('a b')

如果需要多次匹配，请使用select代替select_one。

如何在美丽的Shop 4中获得标签内的内容？

2 个答案: