某些网站的urlib.request.urlopen()无法正常运行

时间:2020-07-17 14:06:31

标签: python-3.x urllib urlopen

我当前正在使用Python 3.6.9,并尝试使用urllib.request库,但事实证明,对于某些url无效,例如,保留的代码段不适用于“ http:// py4e -data.dr-chuck.net/comments_802989.html”和其他几个网址。但是,使用“ https://www.google.com/”之类的几个网址就可以正常工作。

import urllib.request , urllib.parse, urllib.error
from bs4 import BeautifulSoup

url = input("TYPE-:")
html = urllib.request.urlopen(url)
html = html.read()
soup = BeautifulSoup(html,'html.parser')
my = soup('span')
cou = 0
for i in my:
    print(i.contents)
print(cou)

当我提供输入作为下面提到的链接时 TYPE-:http://py4e-data.dr-chuck.net/comments_802989.html

这些是我的终端中弹出的错误消息


Traceback (most recent call last):
  File "/usr/lib/python3.6/urllib/request.py", line 1318, in do_open
    encode_chunked=req.has_header('Transfer-encoding'))
  File "/usr/lib/python3.6/http/client.py", line 1254, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1300, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1249, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1036, in _send_output
    self.send(msg)
  File "/usr/lib/python3.6/http/client.py", line 974, in send
    self.connect()
  File "/usr/lib/python3.6/http/client.py", line 946, in connect
    (self.host,self.port), self.timeout, self.source_address)
  File "/usr/lib/python3.6/socket.py", line 704, in create_connection
    for res in getaddrinfo(host, port, 0, SOCK_STREAM):
  File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo
    for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -2] Name or service not known

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "1.py", line 6, in <module>
    html = urllib.request.urlopen(url)
  File "/usr/lib/python3.6/urllib/request.py", line 223, in urlopen
    return opener.open(url, data, timeout)
  File "/usr/lib/python3.6/urllib/request.py", line 526, in open
    response = self._open(req, data)
  File "/usr/lib/python3.6/urllib/request.py", line 544, in _open
    '_open', req)
  File "/usr/lib/python3.6/urllib/request.py", line 504, in _call_chain
    result = func(*args)
  File "/usr/lib/python3.6/urllib/request.py", line 1346, in http_open
    return self.do_open(http.client.HTTPConnection, req)
  File "/usr/lib/python3.6/urllib/request.py", line 1320, in do_open
    raise URLError(err)
urllib.error.URLError: <urlopen error [Errno -2] Name or service not known>

我尝试了一些建议,例如降级html5并尝试升级软件包,但似乎都没有效果

预先感谢:)

更新:此代码在VsCode终端中运行正常,但在ubuntu终端中未运行并弹出错误,这意味着什么??

0 个答案:

没有答案