Question

所以我通过以下方式获取HTTP GET请求中的数据：

GET http://google.com/ HTTP/1.1
Host: google.com
Proxy-Connection: keep-alive
Cache-Control: max-age=0
etc etc

我想提取GET旁边的网址并将其存储在一个变量中。通过网络上的一些内容查看，但无法找到一种可靠的方法。有人有任何建议吗？

Answer 1

使用regexp：

import re

url_pattern = re.compile("^GET (.*)[ ].*")
line = "GET http://google.com/ HTTP/1.1"
url = url_pattern.match(line).group(1)

如果您确定网址中没有空格:-D