Question

我自己就这个网站了解一些事情。我试图获取原始标题，然后从网站获取3位数的HTTP返回代码。这是我到目前为止所做的：

import urllib.request
with urllib.request.urlopen('https://www.youtube.com/results?search_query=clippers+vs+lakers') as response:
    html_text = response.read()
print(html_text)

它从源打印所有内容。然后我使用＆＃34; Command + F＆＃34;搜索一些关键词，如＆＃34; raw header＆＃34;，但我找不到有用的东西。有人可以帮我从页面源获取原始标题吗？有没有一些图书馆可以做到这一点？谢谢！

Answer 1

尝试response.info()方法获取标题。

Answer 2

最简单，更优雅的方式是使用请求库

import requests as req

url = 'https://www.youtube.com/results?search_query=clippers+vs+lakers'

response = req.get(url)

headers = response.headers

html = response.text

如何使用python查找网站的原始标题？

2 个答案: