使用awk标题

时间:2017-11-17 20:08:55

标签: bash csv awk

所以我正在寻找一些文本文件标题的帮助。

示例数据文件:

$ cat filename.txt
Consoling,ST,DWC,0900
Scribing,RA,DWC,1000
Gloater,AU,DWC,1100
Crimp,DI,DWC,1200
Moving,TI,DWC,1300
Handbook,EN,EBS,0900
Stifling,BA,EBS,1000
Unclothed,CR,EBS,1100
Until,IC,EBS,1200

这是我的awk代码。

sort -t, -k3 -k4 filename.txt |
column -ts, |
awk ' { printf "%-8s %-2s %-3s %-3s\n" , "Family Name", "Initals", "Interviewer Initals", "Interview Time" }
{ printf "%-8s %-2s %-3s %3d\n", $1,$2,$3,$4 }
'

正在输出这样的

Family Name Initals Interviewer Initals Interview Time
Consoling ST DWC 900
Family Name Initals Interviewer Initals Interview Time
Scribing RA DWC 1000
Family Name Initals Interviewer Initals Interview Time
Gloater  AU DWC 1100
Family Name Initals Interviewer Initals Interview Time
Crimp    DI DWC 1200
Family Name Initals Interviewer Initals Interview Time
Moving   TI DWC 1300
Family Name Initals Interviewer Initals Interview Time
Handbook EN EBS 900
Family Name Initals Interviewer Initals Interview Time
Stifling BA EBS 1000
Family Name Initals Interviewer Initals Interview Time
Unclothed CR EBS 1100
Family Name Initals Interviewer Initals Interview Time
Until    IC EBS 1200

但我希望它看起来像是这样。

Family Name Initals Interviewer Initals Interview Time

Consoling   ST      DWC                 0900
Scribing    RA      DWC                 1000
Gloater     AU      DWC                 1100
Crimp       DI      DWC                 1200
Moving      TI      DWC                 1300

Handbook    EN      EBS                 0900
Stifling    BA      EBS                 1000
Unclothed   CR      EBS                 1100
Until       IC      EBS                 1200

有谁知道我必须改变什么?感谢

1 个答案:

答案 0 :(得分:2)

要打印标题一次,请将标题特定的printf命令放在BEGIN块中,例如:

awk '
BEGIN { printf ... header info... }
{ printf ... each data line ... }
'

如果您发现自己想在处理文件后打印一些内容,请使用END块。

至于输出的其余部分,我假设您每次看到一个新的/不同的面试官时都想要打印一个新行。

{ if ( last_interviewer != $3 ) { printf "\n" ; last_interviewr=$3 }
  printf ... each data line 
}

所以,把它们拉在一起......

数据文件:

$ cat filename.txt
Consoling,ST,DWC,0900
Scribing,RA,DWC,1000
Gloater,AU,DWC,1100
Crimp,DI,DWC,1200
Moving,TI,DWC,1300
Handbook,EN,EBS,0900
Stifling,BA,EBS,1000
Unclothed,CR,EBS,1100
Until,IC,EBS,1200

我们的awk解决方案:

$ sort -t, -k3 -k4 filename.txt |
column -ts, |
awk '
# print our header line once, before processing the actual data file:
BEGIN { printf "%-11s %-8s %-20s %-14s\n" , "Family Name", "Initials", "Interviewer Initials", "Interview Time" }

# now process our data file:
{ # if interviewer has changed, print an empty line and make note of our new last_interviewer:
  if ( last_interviewer != $3 ) 
     { printf "\n" ; last_interviewer=$3 }

  # print our current data line:
  printf "%-11s %-8s %-20s %04d\n", $1,$2,$3,$4
}'

Family Name Initials Interviewer Initials Interview Time

Consoling   ST       DWC                  0900
Scribing    RA       DWC                  1000
Gloater     AU       DWC                  1100
Crimp       DI       DWC                  1200
Moving      TI       DWC                  1300

Handbook    EN       EBS                  0900
Stifling    BA       EBS                  1000
Unclothed   CR       EBS                  1100
Until       IC       EBS                  1200

注意:更新了几种printf格式以解决更宽的标题,并在第4列左侧填充零。