Question

以下是我的数据的一部分，我需要从行中删除新的行字符，不包括以匹配格式|HH:MM:SS的序列结尾的行。

以下是前两个记录。第一个记录以＆＃34;重置系统＆＃34;开始第二个是＆＃34; Collaborator通知＆＃34;。

Reset system password SISMED WE|Collaborator requests password reset of SISMED WEB system.
Login: John Doe
Nome: Jackie
Locat: D. XYZ – UA ABC Al
Setor/Depto: Administration
Floor: 1st
Tel./Ramal: 358-108|14/01/2015                    |11:23:22
Collaborator informs that he can not open archiv ... |Collaborator informs you that you can not open files
Path: \\abc\def\ghi\jkl\mno
File: ESCALAS.xls

Name: Hutch cock
Locat: D. Al Mo
Setor/Depto: Hos
Floor: 2nd
Tel./Ramal: 1521
IP: 1.5.2.14|14/01/2015                    |11:26:21

我需要输出下面的东西

Reset system password SISMED WE|Collaborator requests password reset of SISMED WEB system.Login: John Doe Nome: Jackie Locat: D. XYZ – UA ABC Al Setor/Depto: Administration Floor: 1st Tel./Ramal: 358-108|14/01/2015                    |11:23:22
Collaborator informs that he can not open archiv ... |Collaborator informs you that you can not open files Path: \\abc\def\ghi\jkl\mno File: ESCALAS.xls Name: Hutch cock Locat: D. Al Mo Setor/Depto: Hos Floor: 2nd Tel./Ramal: 1521 IP: 1.5.2.14|14/01/2015                    |11:26:21

有些人可以帮我解决UNIX命令。

谢谢。

Answer 1

在原生bash中，旨在提供简洁性的可读性：

#!/usr/bin/env bash

# if we were passed a filename as an argument, read from that file
# otherwise, this script reads from stdin
[[ $1 ]] && exec <"$1"

# ERE-syntax regex matching end-of-record marker
end_of_record_re='[|][[:digit:]]{2}:[[:digit:]]{2}:[[:digit:]]{2}[[:space:]]*$'

buffer=''                       # start out with an empty buffer
while IFS= read -r line; do     # while we can, read a line.
  if ! [[ $line =~ $end_of_record_re ]]; then # unless it has an end marker...
    buffer+=" $line"                          # ...add to our buffer, preceded by a space
  else                                 # if the line has an end marker...
    printf '%s\n' "${buffer# }${line}" # ...print buffer except for first space
    buffer=                            # ...and reset the buffer to be empty
  fi
done

# finally, if we have trailing content, print it out.
[[ $buffer ]] && printf '%s\n' "${buffer# }"

合并行直到看到记录结束标记

1 个答案: