Question

嘿伙计们我正在学习Python（在PyCharm中）并且我一直在学习一些教程

并拥有此工作代码：

        import requests
        from bs4 import BeautifulSoup
        import operator


        def start(url):
            word_list = []
            source_code = requests.get(url).text
            soup = BeautifulSoup(source_code)
            for post_text in soup.findAll('a', {'class': 'index_singleListingTitles'}):
                content = post_text.string
                words = content.lower().split() #make all lower case and splits it up (based on spaces)
                for each_word in words:
                    print(each_word)
                    word_list.append(each_word)

        start('https://www.thenewboston.com/tops.php?type=text&period=this-month')

但我遇到的代码是：for post_text in soup.findAll('a', {'class': 'index_singleListingTitles'}):它使用BeautifulSoup对标签，类等的网页的html源进行排序..并获取信息

我无法将其设置为查找以下Strong标记：

<a href="/anime/5114/Fullmetal_Alchemist:_Brotherhood" class="hoverinfo_trigger" id="#area5114" rel="#info5114"><strong>Fullmetal Alchemist: Brotherhood</strong></a>

我知道大多数网页都反对废弃它们（我只是尝试其他网站和一个页面，我不做任何程序或任何有害的东西）只是学习。

我已经尝试了很多东西，我开始认为它不会在网页上工作我尝试过但是我怀疑BeautifulSoup是否有可能阻止..

任何帮助都会很棒，只学习python 2天所以对我来说很容易！感谢。

BeautifulSoup只找到html标签（没有类）

0 个答案: