Question

我正在使用bs4抓取一些结果。我可以在源代码中看到HTML内容，但是当我尝试使用bs4来获取它时，它并没有显示“文件不存在”

from bs4 import BeautifulSoup
import requests

source = requests.get("https://result.smitcs.in/grade.php?subid=BA1106")    
soup = BeautifulSoup(source.text, "html.parser")

marks_pre = soup.find("pre")
marks = marks_pre.find("div")

print(marks.prettify())

上面的代码返回

<div style="font-family: courier; line-height: 12px;font-size:
20px;background:white;">  File does not exist </div>

如果我从网上复制源代码并将其本地保存为HTML文件，然后进行获取，则上述代码可以正常工作。

Answer 1

尝试

from bs4 import BeautifulSoup
import requests

URL = "https://result.smitcs.in/grade.php?subid=BA1106"
PAGE = requests.get(URL)

# get HTML content
SOUP = BeautifulSoup(PAGE.content, 'lxml')

marks = SOUP.find("div")

print(marks.prettify())

为什么Beautiful Soup不返回内容？

1 个答案: