Question

<td class="tag">
    <a href="/tag/android"  rel="tag">
         <img src="http://127.0.0.1/idf2.png" >
    android
    </a>          
</td>

代码：

soup = BeautifulSoup(html)
print html.td.a.string   # output None

BeautifulSoup4中的哪个方法可以检索<a>的内容android

Answer 1

它是.text，而不是.string：

>>>> soup.td.a.text.strip()
u'android'

请注意，我strip踩它，因为否则它也会包含换行符。

此外，您可能应该通过其他方式查找从中提取文本所需的确切a标记，因为页面上可能有许多a标记，这样您就可以了只得到第一个。但这当然取决于您用来找到正确标签的标准。