Question

美好的一天

我正在尝试编写我的第一个webscraper。我设法写了以下内容：

import requests
from bs4 import BeautifulSoup

s = requests.Session()

r = s.get("http://www.sharenet.co.za/v3/quickshare.php?scode=BTI")

r = s.post("http://www.sharenet.co.za/v3/quickshare.php?scode=BTI")

soup = BeautifulSoup(r.text, "html.parser")
print(soup.find_all("td", class_="dataCell"))

我正试图提取股价。检查元素时，这是HTML代码：

<td class="dataCell" align="right">85221</td>

Image of share price table

基本上，我的问题是可以搜索所有标签，但无法提取我想要的确切标签。

提前感谢您的帮助。

Answer 1

标签有get_text()方法。 find_all会返回一个标记列表。

for cell_tag in soup.find_all("td"):
    print(cell_tag.get_text())

Python中的Webscraper - 如何提取我需要的确切文本？

1 个答案: