Question

我有一个具有以下结构的页面：

<h2 id="about">About</h2>
<p style="margin-top: 0px;" >a bunch of text</p>
<p class="contactAdvisor"><a href="http://www.domain.com"></a></p>

我试图选择第2段中的所有文字（例如一堆文字）

我试过了：

//p[preceding-sibling::h2[id='about'] and following sibling::p[class='contactAdvisor']]

但是，我得不到任何回报。

示例网址：http://www.aplaceformom.com/community/1777-haslett-road-apt-142119

不确定为什么这不起作用。我尝试在Google表格中使用Importxml函数以及Screaming Frog作为自定义提取使用xpath

Answer 1

您也可以使用以下xpath中的任何一个： -

//p[not(child::*)]

//p[preceding::h2[@id = 'about'] and not(child::*)]

//p[preceding::h2[@id = 'about'] and following::p[@class = 'contactAdvisor']]

//p[preceding::h2[@id = 'about']][1]

//p[@class = 'contactAdvisor']/preceding::p

//h2[@id = 'about']/following-sibling::p[1]

//h2[@id = 'about']/following::p[1]

//h2[@id = 'about']/following::p[not(child::*)]

希望它有所帮助.. :)

使用Xpath在页面上的两个不同元素之间选择HTML文本

1 个答案: