我正在为Java的一个子集创建语法,并且似乎遇到了一个问题,其中我的语法被ANTLR认为是模棱两可的(或者,我认为这至少是错误背后的原因)。
这是我语法的相关部分:
expr : (op=MINUS | op=NOT) expr exprPrime -> ^(Expr $op expr exprPrime)
| NEW ID OPEN_PAREN CLOSE_PAREN exprPrime -> ^(Expr NEW ID OPEN_PAREN CLOSE_PAREN exprPrime)
| ID exprPrime -> ^(Expr ID exprPrime)
| THIS exprPrime -> ^(Expr THIS exprPrime)
| INTEGER exprPrime -> ^(Expr INTEGER exprPrime)
| NULL exprPrime -> ^(Expr NULL exprPrime)
| TRUE exprPrime -> ^(Expr TRUE exprPrime)
| FALSE exprPrime -> ^(Expr FALSE exprPrime)
| OPEN_PAREN expr CLOSE_PAREN exprPrime -> ^(Expr OPEN_PAREN expr CLOSE_PAREN exprPrime)
;
// remove left recursion via exprPrime
exprPrime : (op=PLUS | op=MINUS | op=MULT | op=DIV | op=LT | op=LEQ | op=GT | op=GEQ | op=EQ | op=NEQ | op=AND | op=OR | op=NOT) expr exprPrime
-> ^(ExprPrime $op expr exprPrime)
| DOT ID OPEN_PAREN (expr (COMMA expr)*)? CLOSE_PAREN exprPrime
-> ^(ExprPrime DOT ID OPEN_PAREN (expr (COMMA expr)*)? CLOSE_PAREN exprPrime)
|
-> ^(Epsilon)
;
/*------------------------------------------------------------------
* LEXER RULES
*------------------------------------------------------------------*/
CLASS : 'class' ;
PUBLIC : 'public' ;
STATIC : 'static' ;
EXTENDS : 'extends' ;
NEW : 'new' ;
THIS : 'this' ;
NULL : 'null' ;
IF : 'if' ;
ELSE : 'else' ;
WHILE : 'while' ;
MAIN : 'main' ;
TRUE : 'true' ;
FALSE : 'false' ;
RETURN : 'return' ;
SYSO : 'System.out.println' ;
OPEN_BRACKET : '{' ;
CLOSE_BRACKET : '}' ;
OPEN_SQUARE : '[' ;
CLOSE_SQUARE : ']' ;
OPEN_PAREN : '(' ;
CLOSE_PAREN : ')' ;
ASSIGN : '=' ;
COMMA : ',' ;
DOT : '.' ;
SEMICOLON : ';' ;
STRING_TYPE : 'String' ;
INTEGER_TYPE : 'int' ;
VOID_TYPE : 'void' ;
BOOLEAN_TYPE : 'boolean' ;
PLUS : '+' ;
MINUS : '-' ;
MULT : '*' ;
DIV : '/' ;
LT : '<' ;
LEQ : '<=' ;
GT : '>' ;
GEQ : '>=' ;
EQ : '==' ;
NEQ : '!=' ;
AND : '&&' ;
OR : '||' ;
NOT : '!' ;
ID : LETTER (LETTER | DIGIT)* ;
INTEGER : (NON_ZERO_DIGIT DIGIT*) | ZERO ;
WHITESPACE : ('\t' | ' ' | '\r' | '\n'| '\u000C')+ { $channel = HIDDEN; } ;
COMMENT : (('/*' .* '*/') | ('//' .* '\n')) { $channel = HIDDEN; } ;
fragment ZERO : '0' ;
fragment DIGIT : '0'..'9' ;
fragment NON_ZERO_DIGIT : '1'..'9' ;
fragment LETTER : 'a'..'z' | 'A'..'Z' ;
这是我在创建语法时遇到的错误:
warning(200): MiniJava.g:101:11: Decision can match input such as "NEQ" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "EQ" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "MULT" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "DIV" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "GEQ" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "NOT" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "LT" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "LEQ" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "DOT" using multiple alternatives: 2, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "OR" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "PLUS" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "AND" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "MINUS" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
warning(200): MiniJava.g:101:11: Decision can match input such as "GT" using multiple alternatives: 1, 3
As a result, alternative(s) 3 were disabled for that input
行号都指向exprPrime定义的行(为了简洁,我遗漏了大部分的gramar,让我知道是否需要发布更多的gramar)。创建exprPrime本身是为了避免'expr'中的左递归。
知道如何更改语法以消除歧义吗?我不确定我是否理解含糊不清的内容。
答案 0 :(得分:3)
你的语法可以最小化,以显示它的含糊不清:
grammar T; // line 1
// 2
expr // 3
: MINUS expr exprPrime // 4
; // 5
// 6
exprPrime // 7
: MINUS expr exprPrime // 8
| // epsilon // 9
; // 10
// 11
MINUS : '-'; // 12
INTEGER : '0'..'9'+; // 13
从此语法生成解析器将导致以下警告:
[20:38:15] warning(200): T.g:8:2:
Decision can match input such as "MINUS" using multiple alternatives: 1, 2
As a result, alternative(s) 2 were disabled for that input
[20:38:15] error(201): T.g:8:2: The following alternatives can never be matched: 2
第一部分表示在语法的第8行,生成的解析器可以同时使用两个“路径”来匹配令牌MINUS
(第8行和第9行的备选项都匹配MINUS
!) 。这是模棱两可的(在你的语法中,有很多甚至更多)。第二部分告诉您,由于这种歧义,生成的解析器将无法采用第二种替代方法(该路径被禁用)。
另一件事:通过匹配同一替代方案中的所有运算符,您无法使*
具有比-
更高的优先级。所有正确的递归制作都使语法难以理解。我会做这样的事情:
// ...
tokens {
UNARY_MINUS;
PARAMS;
INVOKE;
}
parse
: expr EOF
;
expr
: or_expr
;
or_expr
: and_expr (OR^ and_expr)*
;
and_expr
: rel_expr (AND^ rel_expr)*
;
rel_expr
: eq_expr ((LT | GT | LEQ | GEQ)^ eq_expr)?
;
eq_expr
: add_expr ((EQ | NEQ)^ add_expr)?
;
add_expr
: mult_expr ((PLUS | MINUS)^ mult_expr)*
;
mult_expr
: unary_expr ((MULT | DIV)^ unary_expr)*
;
unary_expr
: MINUS atom -> ^(UNARY_MINUS atom)
| atom
;
atom
: NEW ID OPEN_PAREN params CLOSE_PAREN -> ^(NEW ID params)
| (ID -> ID) (invoke -> ^(ID invoke))?
| THIS
| INTEGER
| NULL
| TRUE
| FALSE
| OPEN_PAREN expr CLOSE_PAREN -> expr
;
invoke
: DOT ID OPEN_PAREN params CLOSE_PAREN invoke? -> ^(INVOKE ID params invoke?)
;
params
: (expr (COMMA expr)*)? -> ^(PARAMS expr*)
;
// ...
没有歧义,并且所有运算符都具有适当的优先级(OR
是最低的,而一元运算符最高(原子甚至更高,当然......))。使用"x.foo(9,y)&&(1==2||true)+2+3+4==333"
规则解析expr
之类的输入将导致以下AST:
答案 1 :(得分:0)
模糊性在于exprPrime
的第三种选择将匹配任何 - 空规则总是成功并且永远不会推进输入。要消除歧义,您可以从定义中删除该备选项,并将所有exprPrime
替换为exprPrime?
。