我遇到了严重的问题,试图让Mechanize保留第一个请求的URL设置的cookie,以便在请求第二个URL时使用。为了测试是否发送了任何内容,我将以下文件放在我的服务器上(cookies.php):
<?php
echo "cookies:::";
print_r($_COOKIE);
?>
以下是python文件,意味着实现我的目标。我希望第一个网站设置的cookie(我输出用于测试)也由我的PHP文件输出(也就是在第二个请求之前设置):
from BeautifulSoup import BeautifulSoup, BeautifulStoneSoup
import mechanize
from pprint import *
#
class NullCookieProcessor(mechanize.HTTPCookieProcessor):
def http_request(self, request): return request
def http_response(self, request, response): return response
opener = mechanize.build_opener(NullCookieProcessor)
request = mechanize.Request('http://www.google.com')
response = mechanize.urlopen(request)
cj = mechanize.CookieJar()
cj.extract_cookies(response, request)
pprint(cj)
request2 = mechanize.Request('http://[domain2].com/cookies.php')
cj.add_cookie_header(request2)
response2 = mechanize.urlopen(request2)
print response2.geturl()
print response2.info() # headers
print response2.read() # body (readline and readlines work too)
以下是输出:
<mechanize._clientcookie.CookieJar[Cookie(version=0, name='NID', value='50=rkj1MMbufL7KRMj00TMF4rI4x7VNYgzWk5P97V05gBAMVOrYuSbb6-hpXVC3y_eD999uECgnBn7YqZ-ZGB1kmWhc_wQWV9nKlPER4_3BWEVSGU632vXEhgYROAz3QrP5', port=None, port_specified=False, domain='.google.com', domain_specified=True, domain_initial_dot=True, path='/', path_specified=True, secure=False, expires=1331337059, discard=False, comment=None, comment_url=None, rest={'HttpOnly': None}, rfc2109=False), Cookie(version=0, name='PREF', value='ID=20342e7c6a6b8f8b:FF=0:TM=1315525859:LM=1315525859:S=RppxtfAGwVsGkZiJ', port=None, port_specified=False, domain='.google.com', domain_specified=True, domain_initial_dot=True, path='/', path_specified=True, secure=False, expires=1378597859, discard=False, comment=None, comment_url=None, rest={}, rfc2109=False)]>
http://[MY DOMAIN].com/cookies.php
Date: Thu, 08 Sep 2011 23:51:01 GMT
Server: Apache mod_fcgid/2.3.6 mod_auth_passthrough/2.1 mod_bwlimited/1.4 FrontPage/5.0.2.2635
X-Powered-By: PHP/5.2.17
Connection: close
Transfer-Encoding: chunked
Content-Type: text/html
cookies:::Array
(
)
任何想法如何让这个工作?应该注意的是,上面的代码是从文档中复制+粘贴的......,我还尝试了文档提供的其他示例代码,可以在这里找到:http://wwwsearch.sourceforge.net/mechanize/doc.html
答案 0 :(得分:3)
request1
和request2
来自不同的域名。可以理解,来自google.com
的Cookie不会发送到'[domain2].com'
。