我有一个格式为的文件:
- foo bar - baz
one two three - or four
and another line
- next job
do this - and that
我的语法是
grammar tasks {
regex TOP { \n* <oneTask>+ \n* }
regex oneTask { ^^ \- (<oneSection> <endSection>)+ }
regex oneSection { \N+ } # this is not quite working
regex endSection { \n+ }
}
在正则表达式oneSection中,我如何编写“我想匹配' - '仅在它不在一行开头时”的事实?
我把文件放入一个字符串并解析这个字符串:
my $content = slurp("taskFile");
my $result = tasks.parse($content);
这不太合适。
<[\N] - [\-]> does not make the match conditional.
谢谢!
答案 0 :(得分:6)
更容易放下想要匹配的内容而不是尝试排除某些内容。
你要找的是一行中的一个字符,它不是换行符或短划线,后跟任意数量的非换行符。或者您正在寻找至少一个不是新行之后的换行符。
regex oneSection {
|| ^^ # beginning of line
<-[\n-]> # not newline or dash
\N* # any number of not newlines
|| <!before ^^> # check the position before this is not the start of a line
\N+
}
(这很复杂,因为你试图将复杂性放在语法中的错误位置)
您也可以像当前一样进行匹配,如果以-
开头,则添加失败的测试。
regex oneSection {
\N+
<!{ # fail if the following is True
$/.starts-with('-')
}>
}
语法是一种类,正则表达式/标记/规则是一种方法。所以你应该通过添加换行符和注释来这样写。
如果您学习如何使用%
和%%
正则表达式运算符,那么编写语法会更好。
(差异为%%
可以匹配尾随分隔符)
有效地使用%
可能需要一些时间来适应,所以我将向您展示如何使用它来匹配您的文件。
我还将部分的分隔符从仅换行更改为换行符和两个空格。这将删除section
匹配的空格,这将简化任何进一步处理。
在学习的过程中,我建议使用Grammar :: Debugger和Grammar :: Tracer。
grammar Tasks {
# use token for its :ratchet behaviour
# ( more performant than regex because it doesn't backtrack )
token TOP {
\n* # ignore any preceding empty lines
<task>+ # at least one task
% # separated by
\n+ # at least one newline
\n* # ignore trailing empty lines
}
token task {
^^ '- ' # a task starts with 「- 」 at the beginning of a line
<section>+ # has at least one section
% # separated by
"\n " # a newline and two spaces
}
token section { \N+ }
}
my $test = q:to/END/;
- foo bar - baz
one two three - or four
and another line
- next job
do this - and that
END
put Tasks.parse( $test, :actions(class {
method TOP ($/) { make @<task>».made.List }
method task ($/) { make @<section>».made.List }
method section ($/) {
make ~$/ # don't do any processing, just make it a Str
}
})).made.perl;
# (("foo bar - baz", "one two three - or four", "and another line"),
# ("next job", "do this - and that"))
如果我将use Grammar::Tracer;
放在顶部,那么它就会输出:
TOP
| task
| | section
| | * MATCH "foo bar - baz"
| | section
| | * MATCH "one two three - or four"
| | section
| | * MATCH "and another line"
| * MATCH "- foo bar - baz\n one two three - or four\n and another l"
| task
| | section
| | * MATCH "next job"
| | section
| | * MATCH "do this - and that"
| * MATCH "- next job\n do this - and that"
| task
| * FAIL
* MATCH "- foo bar - baz\n one two three - or four\n and another line"
预期FAIL
是因为有一个尾随的换行符,并且语法知道后可以跟一个任务。
答案 1 :(得分:0)
匹配任何不是字符串开头的内容,后跟短划线
[^$]-