硒解析亚马逊(Python)

时间:2020-12-27 09:37:34

标签: python selenium parsing

您好,我正在尝试解析文本中的所有星级评论(4、1、4、2 等)

driver.get('https://www.amazon.com/gp/new-releases/kitchen/ref=zg_bs_tab_t_bsnr')

elements=WebDriverWait(driver,20).until(EC.presence_of_all_elements_located((By.CSS_SELECTOR,".a-icon-alt")))
for i in elements:
    i = i.get_attribute("innerHTML")
    i= i.split(' ')[0]
    list3.append(i)

我想解析评论之星,如果评论之星不存在而不是打印一些东西。

1 个答案:

答案 0 :(得分:1)

试试:

from bs4 import BeautifulSoup as bs
from selenium import webdriver
from webdriver_manager.chrome import ChromeDriverManager
import time

driver = webdriver.Chrome(ChromeDriverManager().install())
driver.set_window_size(1024, 600)
driver.maximize_window()
url = "https://www.amazon.com/gp/new-releases/kitchen/ref=zg_bs_tab_t_bsnr"

driver.get(url)
time.sleep(2)

soup=bs(driver.page_source,'html.parser')

for card in soup.findAll('span', {'class': 'aok-inline-block zg-item'}):
    elem = card.find('span', {'class': 'a-icon-alt'})
    if elem:
        print(elem.text.split()[0])
    else: print("no")