使用PHP DOM以粗体格式更改标题和其他字符串修改

时间:2016-07-21 15:13:12

标签: php regex dom preg-replace preg-match

<?php
$data = 'THE CORRECT ANSWER IS C.
<p>Choice A Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industrys standard dummy text ever since the 1500s</p>
<p></p>
<p>Choice B Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industrys standard dummy text ever since the 1500s</p>
<p>Choice D Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industrys standard dummy text ever since the 1500s</p>
<p></p>
<p>Choice E simply dummy text of the printing and typesetting industry.</p>
<p></p>
<p><br>THIS IS MY MAIN TITLE IN CAPS<br>This my sub title.</p>
<p><br>TEST ABC: Lorem Ipsum is simply dummy text of the printing and typesetting industry.</p>
<p>1) It is a long established fact <140/90 mmHg OR <130/80 mmHg making it look like readable English will uncover many web sites still in their infancy. 
<br><br>2) There are many variations of passages of Lorem Ipsum available. </p>
<p><br>TEST XYZ: Lorem Ipsum has been the industrys standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book.</p>
<p><br>TES T TEST: It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged.</p>
<p><br>TESTXXX: It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.</p>';

$dom = new DOMDocument();
@$dom->loadHTML($data, LIBXML_HTML_NODEFDTD);
$xpath = new DOMXPath($dom);
foreach ($xpath->query('//text()') as $node) {
    $txt = trim($node->nodeValue);
    $p   = $node->parentNode;
    if (preg_match("/^\s*(TEST ABC:|TEST XYZ:|TES T TEST:|TESTXXX)(.*)$/s", $node->nodeValue, $matches)) {
        // Put Choice X in bold:
        $p->insertBefore($dom->createElement('strong', $matches[1]), $node);
        $node->nodeValue = " " . trim($matches[2]);
    } else if (strtoupper($txt) === $txt && $txt !== '') {
        // Put header in bold
        $p->insertBefore($dom->createElement('strong', $txt), $node);
        $node->nodeValue = "";
    }
}
$data = $dom->saveHTML();
echo $data;

我已经尝试过第一,第二点工作正常只需要解决第三个问题:

  1. 标题用粗体:“这是我的主要标题”(标题并不总是相同)
  2. 粗体字:TEST ABC:,TEST XYZ:,TES T TEST:,TESTXXX :(这个词总是一样的)
  3. 当您运行此代码时,某些字符串没有显示跳过一行(在字符串外汇中减少和评价:&lt; 140/90 mmHg OR&lt; 130/80 mmHg)。

2 个答案:

答案 0 :(得分:1)

正则表达式确实可用于处理此问题,但通常建议通过DOM执行HTML操作。 PHP的DOMDocument提供了这个。

然后,您可以使用此代码,该代码遍历所有文本节点并查看是否满足以下两个条件中的任何一个:

  • 文本以预定义列表中的文字开头
  • 文字完全是大写的

在这两种情况下,都会使用该内容创建一个新的strong节点,并相应地调整原始节点。

$dom = new DOMDocument();
$dom->loadHTML($data, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$xpath = new DOMXPath($dom);
foreach($xpath->query('//text()') as $node) {
    $txt = trim($node->nodeValue);
    $p = $node->parentNode;
    if (preg_match("/^\s*(TEST ABC:|TEST XYZ:|TES T TEST:|TESTXXX)(.*)$/s", $node->nodeValue, $matches)) {
        // Put Choice X in bold:
        $p->insertBefore($dom->createElement('strong', $matches[1]), $node);
        $node->nodeValue = " " . trim($matches[2]);
    } else if (strtoupper($txt) === $txt && $txt !== '') {
        // Put header in bold
        $p->insertBefore($dom->createElement('strong', $txt), $node);
        $node->nodeValue = "";
    }
}
$data = $dom->saveHTML();

ideone.com

上查看它

答案 1 :(得分:0)

echo preg_replace('/(THIS IS MY MAIN TITLE IN CAPS)/', '<strong>$1</strong>', $data);

Preg_replace语法:preg_replace(正则表达式,替换,主题)

$ 1将显示正则表达式中括号内的标题