Question

我正在尝试打开网址列表，并设置了超时值以排除未打开的网址。但是，当我遇到以下网址时，它会卡住并且不会超时。该网站通常从浏览器打开，所以问题出在哪里？

url='http://www.gizmodo.it/2008/03/12/lo_scanner_di_impronte_digitali_che_distingue_un_dito_vivo_da_unomorto.html'

opener=browser.open(url,timeout=2)

Answer 1

页面（由给定网址引用）以refresh: 185标头响应。这导致HTTPRefreshProcessor睡眠185秒;刷新同一页面;睡觉......（永远）。

您可以使用set_handle_refresh方法关闭HTTPRefreshProcessor，如下所示：

browser.set_handle_refresh(False) # <-----
browser.open(url, timeout=2.0)

python mechanize没有超时并且卡在打开网址

1 个答案: