Question

使用php是否可以从img src中删除http：协议？

所以img src将是：

<img src="//www.example.com/image.jpg" />

而不是

<img src="http://www.example.com/image.jpg" />

str_replace会是一个不错的选择吗？我知道我可以定义：

$contentImg = str_replace(array('http', 'https'), '', $filter);

我只是不确定如何定义$ filter。

Answer 1

是的str_replace is where it's at。这将是一个协议相关的链接。

<?php echo str_replace(array('http:', 'https:'), '', 'http://www.google.com'); ?>

输出

//www.google.com

这是预期的。否则，您可以使用preg_replace，这将允许您使用正则表达式或正则表达式。 CommuSoft 以一个很好的例子发布了答案。

Answer 2

假设$filter工作正常并且源是正确获取的，您还可以使用正则表达式替换：

$contentImg = preg_replace('/^https?:/','', $string);

'/^https?:/'这里是正则表达式： - ^字符表示字符串的开头，这样您只能删除前面的潜在协议。 - ?是一个特殊字符，指定s是可选的。因此，它会匹配http:和https:。

使用正则表达式，您可以更紧凑地编写一些查询。说（为了回答）您还希望删除ftp和sftp，您可以使用：

'/^(https?|s?ftp):/'

由于|表示或，括号用于分组。

您也忘了删除冒号（:）。

我更担心您的$filter将包含整个页面源代码。在这种情况下，由于包含http:的文本也可能被删除，因此弊大于利。为了解析和处理XML / HTML，最好使用DOMParser。这将带来一些开销，但正如一些软件工程师所说：＆＃34;软件工程是针对傻瓜的工程系统，宇宙目前产生越来越多的傻瓜，因此有一小部分额外开销是合理的＆＃34;。

示例：

你应该使用之前提到过的DOMParser（因为这种方法更加安全）：

$dom = new DOMDocument; $dom->loadHTML($html); //$html is the input of the document foreach ($dom->getElementsByTagName('img') as $image) { $image->setAttribute('src',preg_replace('/^https?:/','',$image->getAttribute('src'))); } $html = $dom->saveHTML(); //html no stores the new version

（在php -a中运行此功能可为您提供测试示例的预期输出。）

或者在后处理步骤中：

$html = get_the_content(); $dom = new DOMDocument; $dom->loadHTML($html); //$html is the input of the document foreach ($dom->getElementsByTagName('img') as $image) { $image->setAttribute('src',preg_replace('/^https?:/','',$image->getAttribute('src'))); } $html = $dom->saveHTML(); echo $html;

<强>性能：

使用php -a交互式shell（1'000'000实例）对性能进行了测试：

$ php -a php > $timea=microtime(true); for($i = 0; $i < 10000000; $i++) { str_replace(array('http:', 'https:'), '', 'http://www.google.com'); }; echo (microtime(true)-$timea); echo "\n"; 5.4192590713501 php > $timea=microtime(true); for($i = 0; $i < 10000000; $i++) { preg_replace('/^https?:/','', 'http://www.google.com'); }; echo (microtime(true)-$timea); echo "\n"; 5.986407995224 php > $timea=microtime(true); for($i = 0; $i < 10000000; $i++) { preg_replace('/https?:/','', 'http://www.google.com'); }; echo (microtime(true)-$timea); echo "\n"; 5.8694758415222 php > $timea=microtime(true); for($i = 0; $i < 10000000; $i++) { preg_replace('/(https?|s?ftp):/','', 'http://www.google.com'); }; echo (microtime(true)-$timea); echo "\n"; 6.0902049541473 php > $timea=microtime(true); for($i = 0; $i < 10000000; $i++) { str_replace(array('http:', 'https:','sftp:','ftp:'), '', 'http://www.google.com'); }; echo (microtime(true)-$timea); echo "\n"; 7.2881300449371

因此：

str_replace: 5.4193 s 0.0000054193 s/call preg_replace (with ^): 5.9864 s 0.0000059864 s/call preg_replace (no ^): 5.8695 s 0.0000058695 s/call

有关更多可能的部分（包括sftp和ftp）：

str_replace: 7.2881 s 0.0000072881 s/call preg_replace (no ^): 6.0902 s 0.0000060902 s/call

从img src中删除http：

2 个答案: