由于某些原因,我想将原始http标头发送到服务器,python 请求可以这样做吗?例如,像这样的http标题,
GET http://baidu.com/ HTTP/1.1
Host: baidu.com
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:38.0) Gecko/20100101 Firefox/38.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-US,en;q=0.5
Accept-Encoding: gzip, deflate
Connection: keep-alive
我发现 twisted 可能会这样做,但它有点复杂。
答案 0 :(得分:2)
使用twisted
:
from twisted.internet import reactor
from twisted.web.client import Agent
from twisted.web.http_headers import Headers
agent = Agent(reactor)
d = agent.request(
'GET',
'http://baidu.com/',
Headers({
'User-Agent': ['Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:38.0) Gecko/20100101 Firefox/38.0'],
'Accept': ['text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8'],
'Accept-Language': ['en-US,en;q=0.5'],
'Accept-Encoding': ['gzip, deflate'],
'Connection': ['keep-alive']
}),
None)
def Response(null):
print('Response received')
def Shutdown(null):
print('Shutting down the reactor now')
reactor.stop()
d.addCallback(Response) # exec Response() after request is rcvd
d.addBoth(Shutdown) # shut down after response rcvd
reactor.run()
它更复杂(特别是如果你想用响应来“做事”),但如果你计划在Python中进行web或并发编程,你应该知道twisted
。希望这对您有所帮助,如果不是,我希望它可以帮助有人在使用HTTP标头和twisted
。
使用treq
:
from __future__ import print_function
from treq import get
from twisted.internet.task import react
def handleResponse(response):
""" Callback Function
Once the response is recived, display the information.
This is the part where I suspect people will have the most
trouble wrapping their heads around since it's heavily
dependent on deferreds (ie. futures or promises).
"""
print('Code: %s\n' % response.code)
print('Simple print:')
response.content().addCallback(print) # simple way to print on py2 & py3
text = response.text() # returns a deferred
text.addCallback(displayText) # the way you should be handling responses, ie. via callbacks
def displayText(text):
""" Callback Function
Simply display the text. You would usually do more useful
things in this call back, such as maniuplating the response
text or setting the text to some global or otherwise accessible
variable(s).
"""
print('Deferred print:')
print(text)
def main(reactor):
"""
This is the main function which will execute a request using the
GET method. After getting the response, the response code and content
will be displayed. Finally, the twisted reactor will stop (since
the react function is being used).
"""
url = 'http://baidu.com/'
header={
'User-Agent': ['Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:38.0) Gecko/20100101 Firefox/38.0'],
'Accept': ['text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8'],
'Accept-Language': ['en-US,en;q=0.5'],
'Accept-Encoding': ['gzip, deflate'],
'Connection': ['keep-alive']}
d = get(url, headers=header)
d.addCallback(handleResponse)
return d
react(main) # run the main function and display results
treq
包比直接使用twisted
更容易使用,它共享requests
的许多功能和语法。
twisted
的基础知识。答案 1 :(得分:1)
你可以这样做:
import requests
headers = {'Host': 'baidu.com',
'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10.10; rv:38.0) Gecko/20100101 Firefox/38.0,'
'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
'Accept-Language': 'en-US,en;q=0.5',
'Accept-Encoding': 'gzip, deflate',
'Connection': 'keep-alive'}
requests.get('http://baidu.com/', headers=headers)
答案 2 :(得分:1)
requests.request
方法(及其所有衍生产品,如request.get
或request.head
)可以传递headers
参数。请参阅request和custom headers的文档。
您可以像
一样使用它requests.get('http://baidu.com', headers={'Host':'baidu.com',
'Accept-Encoding': 'gzip, deflate',
...})