需要在一个类之间包含文本

时间:2019-04-09 02:23:36

标签: python-3.x beautifulsoup

我需要从匹配的每一个中捕获文本,直到匹配结束。.该标题将在html中更改很多次。我可以捕获所有数据,但是有更好的方法吗?

<div class="box">
    <a class="visual" href="https://www.example.com">
        <img src="https://www.example.com/img.jpg" alt="image description">
        <h2>Ventura</h2>
    </a>
    <div class="status-row">
        <div class="service">
            <span class="icon nowork"></span> No work                                   
        </div>
        <div class="work">
            <div class="number">0</div> Planned Work
        </div>
    </div>
</div>

    <div class="box">
    <a class="visual" href="https://www.example.com">
        <img src="https://www.example.com/img.jpg" alt="image description">
        <h2>Boston</h2>
    </a>
    <div class="status-row">
        <div class="service">
            <span class="icon disruption"></span> Disruptions                                   
        </div>
        <div class="no-work">
            <div class="number">0</div> No Work
        </div>
    </div>
</div>

    page = requests.get(url,verify=False)
soup = BeautifulSoup(page.text, 'html.parser')

s = 'Ventura'

for x in soup.findAll("div",  {"class": ["box", "status-row"]}):
    z = x.get_text()
    if  s in z.strip():
        print(z)

有更好的方法吗?

0 个答案:

没有答案