Question

我有一个问题，我如何使用HTMLAgilityPack来提取某些特定标签的HTML内容。我提供以下HTML代码：

<td class="text_11" width="80%" valign="top">
     <span class="text_11">Producer:</span>
     <a class="link_11b" href="/Asus_producer.htm">Asus</a>
     <br>
     <p>
        <table class="text_11" border="0" style="width: 100%;">
          <tbody>
        </table>
     <p> </p>
</td>

我真正需要做的是提取<td class="text_11" width="80%" valign="top">所包含的HTML代码（如此处所示），即使<td>内有越来越多的代码。

我需要的是将此文本（即HTML）保存到字符串中： `

 <span class="text_11">Producer:</span>
         <a class="link_11b" href="/Asus_producer.htm">Asus</a>
         <br>
         <p>
            <table class="text_11" border="0" style="width: 100%;">
              <tbody>
            </table>
         <p> </p>

` 感谢。

Answer 1

var xpath = "//td[@class='text_11' and @width='80%' and @valign='top']";
var td = doc.DocumentNode.SelectSingleNode(xpath);
string html = td.InnerHtml; // check also if td is not null

使用HtmlAgilityPack保存某些特定标签的HTML内容

1 个答案: