使用Perl验证年份范围

时间:2016-05-02 17:13:38

标签: regex perl

这些是源文件标题中的有效年份,版权行:

2016
2007,2016
2007-2010,2016
2010,2012-2016

当前年份(2016年)应该是其中的一部分。

无效:

2016,2007 # not in order
2010-2007,2016 #range not in order.
2010-2015 # current year missing.

我制作了如下脚本,仍在使用它,有更好的方法来检查它吗?

use strict;
use warnings;
use List::Util 'first';
use Time::Piece;

my $t = Time::Piece->new();
my $currentYear=$t->year;

my $filename="sample.txt"; # this file name will be passed-in to the script.
my $fileContent=`cat $filename`;
my $copyrightFound = first { /copyright .* Shakespeare/i } $fileContent;
if (!$copyrightFound) {
     print "\nERROR: Copyright missing\n";
     exit;
}
#copyright Found
print $fileContent;
if ($fileContent =~ /Copyright \(c\) (.*) by Bill Shakespeare\s+.*All rights reserved./) {
    print "\nCopyright is good: $1\n";
    my $years = $1;
    print "\n$years, $currentYear\n";
    #e.g: 2016
    if ($years !~ ', ' && $years !~ '-') {
       print "\nerror\n" if ($years !~ /$currentYear/);
    }
    #e.g: 2010-2016
    elsif ($years !~ ', ') {
       print "\nError\n" if ($years !~ /\-$currentYear$/);
    }
    #eg: 2010, 2016
    elsif ($years !~ '-') {
       print "\nError\n" if ($years !~ /\, $currentYear$/);
    }
    #e.g: 2008, 2010, 2011-2016
    elsif ($years =~ '-' && $years =~ ', ') {
       print "\nError 5\n" if ($years !~ /\d\d\d\d, \d\d\d\d\-$currentYear$/);
    }
    else {
       print "invalid format"
    }

} else {
      print "\nCopyright needs to be fixed\n";
}

sample.txt有:

Copyright (c) 2008, 2010, 2011-2016 by Bill Shakespeare
All rights reserved.

1 个答案:

答案 0 :(得分:5)

您可以使用Set::IntSpan来验证日期:

use warnings;
use strict;
use Set::IntSpan qw();

my $currentYear = 2016;

while (<DATA>) {
    s/\s+//g;
    my $ok = 0;
    if (Set::IntSpan->valid($_)) { # Checks order
        my $set = Set::IntSpan->new($_);
        my @years = $set->elements();
        $ok = 1 if grep { $_ == $currentYear } @years;
    }
    print "$ok : $_\n";
}

__DATA__
2007
2007, 2016
2007-2010,2016
2010,2012-2016
2016,2007
2010-2007,2016
2010-2015

打印:

0 : 2007
1 : 2007,2016
1 : 2007-2010,2016
1 : 2010,2012-2016
0 : 2016,2007
0 : 2010-2007,2016
0 : 2010-2015

根据member在评论中的建议,使用grep而不是pilcrow进行简化:

my $ok = 0;
if (Set::IntSpan->valid($_)) { 
    my $set = Set::IntSpan->new($_);
    $ok = 1 if $set->member($currentYear);
}
print "$ok : $_\n";