Question

我需要替换一些单词或一些句子并将其转换为下划线。我正在使用PHP并找到一个引用：PHP Regex, extract all custom tags from text

不知何故，案件只涵盖单个词而不是句子。如何使正则表达式也可以捕获##标记所包含的所有内容？

假设我的输入如下：

“互联网上的Lorem Ipsum ## generators ## 倾向于根据需要重复预定义的块，使其成为Internet上第一个真正的生成器。它使用了一个字典 ## 200拉丁词## ，结合少数模型句子结构，生成看起来合理的Lorem Ipsum。因此生成的Lorem Ipsum总是 ##没有重复## < / strong>，注入幽默，或非特征词等。“

然后输出将是：

“互联网上的Lorem Ipsum ____1____倾向于根据需要重复预定义的块，使其成为互联网上第一个真正的生成器。它使用了超过____2的字典，结合了少数模型句子结构，生成看起来合理的Lorem Ipsum。生成的Lorem Ipsum因此总是____3____，注入幽默或非特征词等。“

有人可以帮我解决如何获得正则表达式模式吗？

Answer 1

我认为正则表达式是：

/##[^#]+##/g

[Regex Demo]

$text = 'll the Lorem Ipsum ##generators## on the Internet tend to repeat predefined chunks as necessary, making this the first true generator on the Internet. It uses a dictionary of over ##200 Latin words##, combined with a handful of model sentence structures, to generate Lorem Ipsum which looks reasonable. The generated Lorem Ipsum is therefore always ##free from repetition##, injected humour, or non-characteristic words etc.';

preg_match_all('/##[^#]+##/', $text, $matches, PREG_SET_ORDER);

for ($i = 0; $i < count($matches); $i++) {
  $text = preg_replace("/".$matches[$i][0]."/", "___".strval($i+1)."___" , $text, 1);
}

[PHP Demo]

Answer 2

使用@ shA.t的正则表达式的另一个想法是使用preg_replace_callback和一个递增变量的函数。所以它可以在没有循环的情况下完成，这可能会提高效率。

$str = preg_replace_callback('/##[^#]+##/', function($m) use (&$i) {
  return "____". ++$i ."____";
}, $str);

See php demo at eval.in

正则表达式查找自定义符号

2 个答案: