Python从URL读取页面?更好的文档?

时间:2011-10-01 01:56:57

标签: python url urllib

我在使用Python的文档时遇到了很多麻烦。有没有像Mozilla开发者网络那样的东西?

我正在做一个Python拼图网站,我需要能够阅读该页面的内容。我在网站上看到了以下内容:

import urllib2

urlStr = 'http://www.python.org/'
try:
  fileHandle = urllib2.urlopen(urlStr)
  str1 = fileHandle.read()
  fileHandle.close()
  print ('-'*50)
  print ('HTML code of URL =', urlStr)
  print ('-'*50)
except IOError:
  print ('Cannot open URL %s for reading' % urlStr)
  str1 = 'error!'

print (str1)

它一直说没有urllib2模块。

Python文档说

The urllib module has been split into parts and renamed in Python 3.0 to urllib.request, urllib.parse, and urllib.error. The 2to3 tool will automatically adapt imports when converting your sources to 3.0. Also note that the urllib.urlopen() function has been removed in Python 3.0 in favor of urllib2.urlopen().

我也试过导入urllib.request,但是它确定了urllib 2的定义...... WTF正在进行中?

版本3.2.2

2 个答案:

答案 0 :(得分:4)

urllib.request.open() ...

中的建议使用Dive into Python 3
Python 3.2.1 (default, Jul 24 2011, 22:21:06) 
[GCC 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2335.15.00)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import urllib.request
>>> urlStr = 'http://www.python.org/'
>>> fileHandle = urllib.request.urlopen(urlStr)
>>> print(fileHandle.read()[:100])
b'<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtm'

答案 1 :(得分:3)

您可能引用的文档是the Python 2 documentation for urllib2。您应该使用的文档是the Python 3 documentation for urllib.request