Question

嗨，我正在尝试编写一个抓取URL的程序，如果抓取数据包含特定字符串，请执行一些操作，我该如何使用精美的汤来实现这一目标

import requests
from bs4 import BeautifulSoup
data = requests.get('https://www.google.com',verify=False)
soup= BeautifulSoup(data.string,'html.parser')
for inp in soup.find_all('input'):
    if inp == "Google Search":
        print ("found")
    else:
        print ("nothing")

Answer 1

您的inp是html对象。您必须使用get_text（）函数

import requests
from bs4 import BeautifulSoup
data = requests.get('https://www.google.com',verify=False)
soup= BeautifulSoup(data.string,'html.parser')
for inp in soup.find_all('input'):
    if inp.get_text() == "Google Search":
        print ("found")
    else:
        print ("nothing")

Answer 2

verify = False，禁用证书验证。这是安全问题。尤其是如果您在公司网络内部，这很危险，因为它为中级攻击提供了可能。您应该使用适当的证书授权。

如何使用beautifulsoup检查字符串是否存在

2 个答案: