正则表达式和字符大小写

时间:2010-08-12 07:45:28

标签: regex perl casing

好的,我有一个相当简单的(至少看起来很简单)。我有一个多线的字符串,我只是在用其他东西替换不同的单词。让我告诉你......

#!/usr/bin/perl -w
use strict;

$_ = "That is my coat.\nCoats are very expensive.";
s/coat/Hat/igm;
print;

输出为
That is my Hat
Hats are very expensive...

第一行的“帽子”不应大写。是否有任何技巧可以使套管符合英语的编写方式?谢谢:))

4 个答案:

答案 0 :(得分:5)

答案 1 :(得分:4)

您可以使用e修饰符s///来执行此操作:

s/(coat)/ucfirst($1) eq $1 ? 'Hat' : 'hat'/igme;

答案 2 :(得分:1)

首先,您应该使用\b(单词边界)来匹配整个单词。例如,s/hat/coat/会将That更改为Tcoat而不会引导\b。现在提出你的问题。使用标志/e,您可以在正则表达式的替换部分中使用Perl代码。因此,您可以编写一个Perl函数来检查匹配的大小写,然后正确设置替换的大小写:

my $s = "That is my coat.\nCoats are very expensive.";
$s =~ s/(\bcoat)/&same_case($1, "hat")/igme;
print $s, "\n";

sub same_case {
        my ($match, $replacement) = @_;

        # if match starts with uppercase character, apply ucfirst to replacement
        if($match =~ /^[A-Z]/) {
                return ucfirst($replacement);
        }
        else {
                return $replacement;
        }
}

打印:

That is my hat.
Hats are very expensive.

答案 3 :(得分:0)

这可以解决您的问题:


#!/usr/bin/perl -w

use strict;

sub smartSubstitute {
    my $target = shift;
    my $pattern = shift;
    my $replacement = shift;

    $pattern = ucfirst $pattern;
    $replacement = ucfirst $replacement;

    $target =~ s/$pattern/$replacement/gm;

    $pattern = lcfirst $pattern;
    $replacement = lcfirst $replacement;

    $target =~ s/$pattern/$replacement/gm;

    return $target;
}

my $x = "That is my coat.\nCoats are very expansive.";
my $y = smartSubstitute($x, "coat", "Hat");
print $y, "\n";