运行test.py
时出现此错误C:\Python32>python.exe test.py
Traceback (most recent call last):
File "test.py", line 5, in <module>
import httplib
ImportError: No module named httplib
如何纠正?
test.py 的代码块:
#!/usr/local/bin/python
import httplib
import sys
import re
from HTMLParser import HTMLParser
class miniHTMLParser( HTMLParser ):
viewedQueue = []
instQueue = []
def get_next_link( self ):
if self.instQueue == []:
return ''
else:
return self.instQueue.pop(0)
def gethtmlfile( self, site, page ):
try:
httpconn = httplib.HTTPConnection(site)
httpconn.request("GET", page)
resp = httpconn.getresponse()
resppage = resp.read()
except:
resppage = ""
return resppage
def handle_starttag( self, tag, attrs ):
if tag == 'a':
newstr = str(attrs[0][1])
if re.search('http', newstr) == None:
if re.search('mailto', newstr) == None:
if re.search('htm', newstr) != None:
if (newstr in self.viewedQueue) == False:
print (" adding", newstr)
self.instQueue.append( newstr )
self.viewedQueue.append( newstr )
else:
print (" ignoring", newstr)
else:
print (" ignoring", newstr)
else:
print (" ignoring", newstr)
def main():
if sys.argv[1] == '':
print ("usage is ./minispider.py site link")
sys.exit(2)
mySpider = miniHTMLParser()
link = sys.argv[2]
while link != '':
print ("\nChecking link ", link)
# Get the file from the site and link
retfile = mySpider.gethtmlfile( sys.argv[1], link )
# Feed the file into the HTML parser
mySpider.feed(retfile)
# Search the retfile here
# Get the next link in level traversal order
link = mySpider.get_next_link()
mySpider.close()
print ("\ndone\n")
if __name__ == "__main__":
main()
答案 0 :(得分:88)
您正在Python 3上运行Python 2代码。在Python 3中,该模块已重命名为http.client
。
您可以尝试在代码上运行2to3
tool,并尝试自动翻译。对httplib
的引用将自动重写,以改为使用http.client
。
答案 1 :(得分:0)
当我尝试缩小Docker容器时遇到了这个问题。这是因为我用以下命令安装了Python 2.7:
apt-get install -y --no-install-recommends python
我不应该包含--no-install-recommends
标志:
apt-get install -y python
答案 2 :(得分:0)
答案 3 :(得分:0)
您可以使用以下代码导入 http.client 并将其重命名为 httplib :
将 http.client 导入为 httplib