我必须编写一个shell / perl脚本来扫描日志文件最近30分钟的数据。要求是在Cron中安排此脚本每30分钟运行一次并查找错误字符串。
OS: Solaris
Shell:Bash
我已尝试过以下脚本,但它已经变得太长而且笨拙,我们还有其他办法让它缩短一点吗?
blogs=/opt/docs/datapower/prod/business.log
slogs=/opt/docs/datapower/prod/system.log
starttime=$(date +'%H')
currmin=$(date +'%M')
curdate=`date|cut -d' ' -f5`
echo $(date)
if [ $currmin -le 29 ] && [ $starttime -ne 00 ] ; then
starttime1=`echo "$(date +'%H') - 1" | bc`
logtime="$starttime1"
logtime="$logtime:[3-5][0-9]"
echo $logtime
elif [ $currmin -le 29 ] && [ $starttime -eq 00 ] ; then
logtime="23:[3-5][0-9]"
echo $logtime
else
logtime="$starttime"
logtime="$logtime:[0-2][0-9]"
echo $logtime
fi
if ( grep "$logtime" $slogs | egrep "AAA Authentication Failure|AAA Authorization Failure") > dptest 2>&1;then
Do something
fi
更新:添加示例日志语句。
以下是log语句的示例:
Nov 20 06:06:58 business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>
答案 0 :(得分:1)
我认为你正在做一些倒退 - 建立一个RE来从日志文件中提取日期。
在perl中接近这个我想要阅读整个日志文件,标记它 - 提取时间戳 - 然后根据消息内容发出警报。
Perl为第一部分提供了一个很好的模块 - Time::Piece
。
它有点像这样:
use strict;
use warnings;
use Time::Piece;
my $HALF_HOUR = 30 * 60;
while (<DATA>) {
#extract timestamp via regular expression
my ( $timestamp, $message ) = (m/\A(\w+\s+\d+\s+\d+:\d+:\d+) (.*)/);
#convert text timestamp to 'unix time'.
#need the year in here because your log doesn't include it.
my $t = localtime();
$t = $t->strptime( $timestamp . " " . $t->year, "%b %d %H:%M:%S %Y" );
#skip if parsed time is more than half an hour ago.
next if ( $t < time() - $HALF_HOUR );
if ( $message =~ m/AAA Authentication failure/i
or $message =~ m/AAA Authorization failure/i )
{
print "Alert: ( $t ) $message\n";
}
}
__DATA__
Nov 20 13:46:58 business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>
Nov 20 13:00:58 business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>
Nov 20 10:06:58 business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>
后续问:
“能否解释一下这句话的作用,my ( $timestamp, $message ) = (m/\A(\w+\s+\d+\s+\d+:\d+:\d+) (.*)/);
”
这有两件事:
\A(\w+\s+\d+\s+\d+:\d+:\d+)
- 将从行首开始匹配:
\d+:\d+:\d+
会抓住时间。 (任何3个冒号分隔的数字)。 当然,另一部分捕获了“其余部分”。
$timestamp
和$message
)。 净结果是 - 给定行:
Nov 20 13:46:58 business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>
(\w+ \d+ \d+:\d+:\d+) (.*)
我们的正则表达式分别返回两个“块”,然后我们将它们放入两个变量中。
答案 1 :(得分:0)
你如何使用sqlite3来做过滤 - 你解析时间的好处可能非常方便。 唯一的背后是你必须规范化数据。
function sqlite-filter-time() {
if [ '0' = "$#" ]; then
echo "Usage: $FUNCNAME <file> <timespan> <where>"
return
fi
local year="$(date '+%Y')"
local ofs='___FS___'
sed "s,^\([^ ]* [^ ]*\) \([^ ]*\),\1 \2$ofs," "$1" | sed "s,Jan ,$year-01-,;s,Feb ,$year-02-,;s,Mar ,$year-03-,;s,Apr ,$year-04-,;s,May ,$year-05-,;s,June ,$year-06-,;s,July ,$year-07-,;s,Aug ,$year-08-,;s,Sep ,$year-09-,;s,Oct ,$year-10-,;s,Nov ,$year-11-,;s,Dec ,$year-12-," > "$1.tmp" # normalize data for sqlite - command to extract the date and the rest of the text
{
echo '.mode csv'
echo 'DROP TABLE IF EXISTS sft;'
echo 'CREATE TEMPORARY TABLE sft ('
echo ' sft_date TEXT,'
echo ' sft_text TEXT'
echo ');'
echo ".headers off"
echo ".nullvalue ''"
echo ".separator '$ofs'"
echo ".import $1.tmp sft"
echo ".separator ' '"
echo "SELECT *"
echo "FROM sft"
echo "WHERE sft_date > datetime('now', '$2')"
echo " AND (sft_text like '%AAA Authentication Failure%'"
echo " OR sft_text like '%AAA Authorization Failure%'"
echo " )"
echo ";"
} | sqlite3
rm "$1.tmp"
}
$ sqlite-filter-time "$slogs" '-30 minutes'
"2014-11-20 16:01:58" " business-log-sta [DP-Domain-STAGING][0x80000001][business-log][info] mpgw(GenServiceMPG): trans(31513092)[request]: AAA Authentication failure/>"
$