Question

我正在尝试使用Python，BeautifulSoup和Selenium从Quora中提取数据以进行分析。但是我无法提取页面上的JavaScript元素。我应该如何提取它们？

在这里，我只想提取Quora配置文件的简介，但我没有收到单击“更多”按钮后出现的文本。

                # Extracting Bio
                driver.find_element_by_class_name('ui_qtext_more_link').send_keys(Keys.ENTER)
                bio = driver.find_element_by_class_name("ui_qtext_rendered_qtext").text

Answer 1

请使用下面的代码行首先单击“更多”按钮，然后获取个人资料的展开文本。

import time
//Fetch the more button element first
WebElement moreButton = driver.find_element_by_xpath("(//a[@class='ui_qtext_more_link'])[1]");    
//Click on the more button
moreButton.click();
time.sleep(3)
//Fetch the profileInfo element
WebElement profileInfo = driver.find_element_by_xpath("(//div[contains(@id,'expanded_content')]//span[@class='ui_qtext_rendered_qtext'])[1]");
//Store the bio in a string and use it further
String profileInfoBio = profileInfo.text;

刮取Quora时无法提取JavaScript元素

1 个答案: