网站上的文字未使用硒进行抓取

时间:2020-08-19 04:17:56

标签: python selenium discord.py

我正在尝试从网站订购的轮胎发送文本到不和谐的嵌入物中。我登录该站点,搜索一个项目,然后看到它可用。我希望嵌入的不和谐显示可订购的数量,但什么也不显示。它不会显示其他元素,但不会显示AvailableAlt PlantTire TreadEst Delivery

(很抱歉,如果我的代码混乱,我是新来的人,并且在过去的一个小时里一直在玩它)

# Finding the Quantity + Tire Description
    try:
        available1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "//*[@id='resultsTable']/tbody/tr[2]/td[9]"))
        )
        altPlant1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "/html/body/div[2]/div[2]/div[2]/div/form/div[1]/table/tbody/tr[2]/td[10]"))
        )
        estDelivery1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "/html/body/div[2]/div[2]/div[2]/div/form/div[1]/table/tbody/tr[2]/td[11]"))
        )
        tireSize1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "/html/body/div[2]/div[2]/div[2]/div/form/div[1]/table/tbody/tr[2]/td[5]"))
        )
        tireTread1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "//*[@id='resultsTable']/tbody/tr[2]/td[7]"))
        )
        itemCode1 = WebDriverWait(driver, 10).until(
            EC.presence_of_element_located((By.XPATH, "/html/body/div[2]/div[2]/div[2]/div/form/div[1]/table/tbody/tr[2]/td[2]"))
        )
    except:
        driver.quit()
    
    available = available1.text
    altPlant = altPlant1.text
    estDelivery = estDelivery1.text
    tireSize = tireSize1.text
    tireTread = tireTread1.text
    itemCode = itemCode1.text

    print(available)
    print(altPlant)
    print(estDelivery)
    print(tireSize)
    print(tireTread)
    print(itemCode)
    
# Discord Embed Setup   
    embed = Embed(
        description='**Stock available for item number '+pItemNumber+'**',
        color=0x0d0d22,
        timestamp='now'  # sets the timestamp to current time
        )

    embed.set_author(name='Pirelli Stock Check')
    embed.add_field(name='Item Number', value=itemCode, inline=True)
    embed.add_field(name='Tire Description', value=tireTread+' '+tireSize, inline=True)
    embed.add_field(name='Available', value=available+' In Stock', inline=False)
    embed.add_field(name='Alt Plant', value=altPlant+ ' In Stock', inline=True)
    embed.add_field(name='Est Delivery', value=estDelivery+'1', inline=True)
    embed.set_footer(text='Tires Tools', icon_url='https://cdn.discordapp.com/avatars/628005829840470037/8286685de0f2d7d94d94e020caf3265d.png?size=128')

    hook.send(embed=embed)

    print("Embed sent to discord!")

我需要的4个字符串也在源代码中。 tireTread = CINTURATO P7available = 12altPlant = 7estDelivery = 08/24/2020

The 4 rl

这就是我不和谐嵌入的结果。

enter image description here

知道我在做什么错吗?让我知道您是否需要更多信息。

1 个答案:

答案 0 :(得分:1)

您可以按其ID或CSS类查找元素:

table = WebDriverWait(driver,10).until(EC.presence_of_element_located((By.ID, "tabla_evolucion")))


table = WebDriverWait(driver,10).until(EC.presence_of_element_located((By.class, "css-class sub-class")))

我建议您按ID查找表,然后通过循环读取行。也许您需要根据需要更改以下代码。

table_id = self.driver.find_element(By.ID, 'table_id')
rows = table_id.find_elements(By.TAG_NAME, "tr") # get all of the rows in the table
for row in rows:
    # Get the columns (all the column 2)        
    col = row.find_elements(By.TAG_NAME, "td")[1] #note: index start from 0, 1 is col 2
    print col.text #prints text from the element