Question

我需要使用Urllib访问网站，然后在该网站上搜索该网页上的所有图像。我相信我已经成功编写了访问该网站的代码，我只需要立即搜索该网站。

我将能够创建正则表达式，但是在图像以HTML格式显示时我需要帮助，因此我知道如何创建正则表达式来搜索该图像。

我发布的代码不包含正则表达式，因为我尚未编写正则表达式，我之所以将其包括在内是因为。只是在寻找一些指导。感谢您的所有帮助！

    import urllib.request
    import ssl

    website = 'https://www.google.com'

    html = urllib.request.urlopen(website)
    for line in html:
        print(line)

Answer 1

from bs4 import BeautifulSoup
soup = BeautifulSoup(html, 'html.parser')
for img in soup.find_all('img'):
  print img

请参见https://www.crummy.com/software/BeautifulSoup/bs4/doc/#quick-start。

通过Urllib.request搜索W /正则表达式

1 个答案: