goquery:在到达另一个元素时停止解析

时间:2017-01-10 21:29:41

标签: go goquery

假设我有这个HTML页面。我想使用Gogoquery解析它:

<html>
    <head><!--Page header stuff--></head>
    <body>
         <h1 class="h1-class">Heading 1</h1>
             <div class="div-class">Stuff1</div>
             <div class="div-class">Stuff2</div>
         <h1 class="h1-class">Heading 2</h1>
             <div class="div-class">Stuff3</div>
             <div class="div-class">Stuff4</div>
    </body>
</html>

碰巧,我只希望在标题2之前获得那些DIV并跳过其余部分。此代码非常适合所有 DIV:

 doc := GetGoQueryDocument(url) //Defined elsewhere
 doc.Find("div.div-class").Each(func(_ int, theDiv *goquery.Selection){
     //do stuff with each theDiv
     //The problem is that it finds div.div-class elements below Heading 2.
     //I want to skip those.
 })

有没有办法告诉goquery跳过某个标签和类名下面的元素?感谢您的任何提示!

1 个答案:

答案 0 :(得分:2)

是的,实际上非常简单:

doc.Find(".h1-class").First().NextUntil(".h1-class")

我建议你通读godoc:https://godoc.org/github.com/PuerkitoBio/goquery

它解释了您可以操纵选择的所有不同方法。