Question

我有一个包含html代码的日志文件我需要删除此文件中每个可能匹配的html标记之间的所有内容。如何使用过滤器？

我的档案示例：

some text here
<html>
code
</html>
some text there
<html>
code
</html>
some other text

输出应为：

some text here
some text there
some other text

Answer 1

此awk应该：

awk '/<html>/{f=1;next} !f; /<\/html>/{f=0}' file
some text here
some text there
some other text

Answer 2

为什么不呢：

sed '/<html>/,/<\/html>/d'

它适用于您的示例。