使用html-agility-pack查找包含数据的所有元素 - 属性

时间:2016-02-05 14:54:13

标签: c# html-agility-pack

我正在使用Html-agility-Pack来解析Html文本块。是否可以按属性/属性值查找所有元素的列表?

例如,下面是一个示例html文本。使用Html-agility-pack如何找到具有“data-glossaryid”属性的所有元素?

<p> sample text <a href="" data-glossaryid="F776EB48BD"></a>
<p><img  alt="my pic" src="/~/media/Images/mypic.jpg" /></p>
sample text 
<a  href="" data-glossaryid="5D476EB49E"></a>
<p>  more sample text  </p>
<span data-glossaryid="F776EB49EF"> </span>

1 个答案:

答案 0 :(得分:4)

// the html block of text to parse
var a = @"<p> sample text <a href="""" data-glossaryid=""F776EB48BD""></a>
<p><img  alt=""my pic"" src=""/~/media/Images/mypic.jpg"" /></p>
sample text <a  href="""" data-glossaryid=""5D476EB49E""></a>
<p>  more sample text  </p>
<span data-glossaryid=""F776EB49EF""> </span>";

// create an HtmlDocument
var htmlDocument = new HtmlDocument();
htmlDocument.LoadHtml(a);

// get all elements with the attr data-glossaryid and prints its values
foreach (var item in htmlDocument.DocumentNode.SelectNodes("//*[@data-glossaryid]"))
    Console.WriteLine(item.GetAttributeValue("data-glossaryid", ""));