Question

我能够使用美丽的汤

获取带有< list_selected >元素的TD元素中的文本

soup.find_all(class_ = {"list_selected"})

之后我必须获取NAME部分。有许多类似的块。

<tr>
        <td align="left" style="padding-left: 3px;padding-right:3px;" class="list_selected">1422</td>
        <td align="left" style="padding-left: 3px;padding-right:3px;" class="data">123456</td>
        <td align="left" style="padding-left: 3px;padding-right:3px;" class="data">NAME</td>
</tr>

Answer 1

soup.find_all("td", { "class" : "list_selected" })

这将为您获取td个节点。结果是根据documentation的节点列表。

Answer 2

漂亮的肥皂有一个名为（.text）的方法来获取html文件的内部内容

我更正了下面的代码以获取内部文字

 from bs4 import BeautifulSoup

 soup1 = BeautifulSoup('<td align="left" style="padding-left:3px;padding-right:3px;" class="list_selected">1422</td>',"lxml")

 second= soup1.find("td", {"class": "list_selected"}) #Finding td class

 name = second.text #Getting inner text contents of td class

 print name #Displays inner text

希望你把一切都弄好：）

使用Beautiful Soup获取元素

2 个答案: