Question

我正在尝试使用urllib2跟踪网址的重定向。

>>> import urllib2
>>> page=urllib2.urlopen('http://acer.com')
>>> print page.geturl()
http://www.acer.com/worldwide/selection.html
>>>page=urllib2.urlopen('http://www.acer.com/worldwide/selection.html')
>>> print page.geturl()
http://www.acer.com/worldwide/selection.html

但是当我在浏览器中打开http://www.acer.com/worldwide/selection.html时，它会重定向到http://us.acer.com/ac/en/US/content/home#_ga=1.216787925.232352975.1435019296如何使用urllib检测此重定向。

Answer 1

get_url()不适用于所有重定向（例如JavaScript重定向）

你想要达到什么目标？

像Selenium那样PhantomJS作为后端可能更适合这一点。

对于屏幕截图，您可以使用save_screenshot()，这是Selenium Webdriver的一部分

Answer 2

使用selenium开始。我正在使用chromedriver作为浏览器：

from selenium.webdriver import Chrome cr = Chrome() cr.get(url) cr.save_screenshot('IMAGE_NAME.png')

使用urllib2关注重定向

2 个答案: