意外的正则表达式导致html

时间:2013-12-16 06:08:29

标签: html regex

尝试在下面使用正则表达式提取但返回四个值:

<a id="ucManageTaxFilesGridControl_gridAccountHistory_lnkHistoryTaxFileName_3" href="javascript:__doPostBack(&#39;ucManageTaxFilesGridControl$gridAccountHistory$ctl05$lnkHistoryTaxFileName&#39;,&#39;&#39;)" style="font-family:Tahoma;font-size:0.8em;font-weight:bold;">TaxFile1</a>

              </td><td>                                                
                <span id="ucManageTaxFilesGridControl_gridAccountHistory_lblHistoryStatus_3" style="color:Black;font-family:Tahoma;font-size:0.8em;">Opened</span>

这是我使用的正则表达式:

<a(?:[^>]id\=\"ucManageTaxFilesGridControl_gridAccountHistory_lnkHistoryTaxFileName.*?\".*?doPostBack\(&#39;(.*?)\&\#39[^>]*)?>(.*?)<\/a>(.*?(\n))+.*<span.*?id=\"ucManageTaxFilesGridControl_gridAccountHistory_lblHistoryStatus_.*\">Opened<\/span>

获得以下结果:

  1. ucManageTaxFilesGridControl $ $ gridAccountHistory $ ctl02 lnkHistoryTaxFileName
  2. TaxFile1
  3. ¬
  4. ¬
  5. 请问我怎样才能得到这个预期结果?

    1. ucManageTaxFilesGridControl $ $ gridAccountHistory $ ctl02 lnkHistoryTaxFileName
    2. TaxFile1
    3. 提前致谢。

1 个答案:

答案 0 :(得分:0)

无法在下面提供和引用的字符串中匹配ucManageTaxFilesGridControl$gridAccountHistory$ctl02$lnkHistoryTaxFileNameTaxFile1

    </td><td>                                                
        <span id="ucManageTaxFilesGridControl_gridAccountHistory_lblHistoryStatus_3" style="color:Black;font-family:Tahoma;font-size:0.8em;">Opened</span>