Question

假设我有一个shell脚本foo.sh，它会生成重复的数据集，如下所示：

<name 1>
<address 1>
<mother's name 1>
<pet's name 1>
<comment 1>
<name 2>
<address 2>
<mother's name 2>
<pet's name 2>
<comment 2>
...

我想从每个5行块中提取第1行和第3行，因此修改后的输出如下所示：

<name 1>
<mother's name 1>
<name 2>
<mother's name 2>
...

这些线路没有独特的模式，所以我需要专门按行号过滤。怎么办呢？

Answer 1

使用GNU sed：

sed -n '1~5p;3~5p' file.txt

来自GNU sed手册：

first~step
    Match every step'th line starting with line first. For example, ``sed -n 1~2p''
    will print all the odd-numbered lines in the input stream, and the address 2~5
    will match every fifth line, starting with the second. first can be zero; in
    this case, sed operates as if it were equal to step.  (This is an extension.)

Answer 2

我不是sed专家，但您可以在Awk中执行此操作：

$ awk '(i==0 || i==2) { print }
                      { i = (i+1) % 5 }
  ' < filename.txt

或者，在一行

$ awk '(NR%5==1 || NR%5==3)' < filename.txt

Answer 3

您可以使用awk吗？

awk 'NR % 5 == 1 || NR % 5 == 3' foofile

所以基本上发生的事情是

awk打开foofile。
它逐行读取
行号除以5，如果余数为1或3（即5个块中的第一行或第3行），则打印该行

Answer 4

这可能对您有用：

sed -n 'h;n;n;H;n;n;g;p' file

如何使用sed过滤线条图案？

4 个答案: