Question

我试图找出如何使用RapidMiner上的xPath从下面的h1下面选择文本的ABCD部分。

 <h1> 2010 ABCD EFGTTH FEEDS ASSGEAA </h1>

我已经提供以下内容，但它根本不会返回任何结果。

substring-before(substring-after(substring(//h1, 1, 4)/text(), ' '), ' ')

有人可以给我一些建议吗？谢谢。

Answer 1

这是一个可用于此特定 HTML输入的XPath：

substring-before(substring-after(normalize-space(//h1), ' '), ' ')

基本上，我们的想法是首先删除前导空格，然后获取第一个空格和下一个空格之间的子字符串。

xpathtester演示链接： http://www.xpathtester.com/xpath/24ce7e27ad7030f95032b5b290c57fe0

输出 ABCD