Question

我想编写一个shell脚本，它将从标准输入读取文件，删除所有字符串和空行字符，并将输出写入标准输出。文件看起来像这样：

#some lines that do not contain <html> in here
<html>a<html>
<tr><html>b</html></tr>
#some lines that do not contain <html> in here
<html>c</html>

因此，输出文件应包含：

#some lines that do not contain <html> in here
a
<tr>b</html></tr>
#some lines that do not contain <html> in here
c</html>

我尝试编写这个shell脚本：

read INPUT #read file from std input
tr -d '[:blank:]'
grep "<html>" | sed -r 's/<html>//g'
echo $INPUT

但是这个脚本根本不起作用。任何的想法？ THX

Answer 1

Awk可以轻松完成：

awk '/./ {gsub("<html>","");print}' INPUTFILE

首先，它在每行至少有一个字符的情况下操作（因此空行被丢弃），并在行上用空字符串全局替换“<html>”，然后打印出来。

Answer 2

Pure bash：

#!/bin/bash

while read line
do
    #ignore comments
    [[ "$line" = "\#" ]] && continue
    #ignore empty lines
    [[ $line =~ ^$ ]] && continue
    echo ${line//\<html\>/}
done < $1

输出：

$ ./replace.sh input
#some lines that do not contain in here
a
<tr>b</html></tr>
#some lines that do not contain in here
c</html>

Pure sed：

sed -e :a -e '/^[^#]/N; s/<html>//; ta' input | sed '/^$/d'

编写shell脚本

2 个答案: