之前我创建了this question,询问如何使用ANTLR 4创建if / else语句。我得到了一个很好的答案,它也展示了如何做while循环。我已经用我的语言实现了这一点,现在我正在尝试使用几乎相同的原则进行do-while循环。
my循环的语法如下:
count is 0
while count is less than 10
count+
if count not equals 10
write " " + count + ": Getting there..."
else if count equals 10
write count + ": The end!"
end if
end while
这就是我想做的do-while循环:
count is 0
do
count+
write "count is " + count
if count equals 10
write "The end!"
end if
while count is less than 10
我测试了它,但它们都有效,但不是在同一时间。下面是我的语法(很抱歉发布所有内容,但我认为这是必要的)。
如果我的WHILE
和END_WHILE
令牌高于我的DO_WHILE
和DO_WHILE_CONDITION
令牌,则while循环有效。但是,如果我将它们切换到我的do-while循环工作。如果我将DO_WHILE_CONDITION
令牌更改为而,那么两者都有效。
无论如何,我可以让它们都使用当前语法吗?我知道这可能是一个问题,因为我对多个事物使用相同的关键字,但我希望有一种方法可以做到这一点。
//////////////////////////////////
// PARSER
//////////////////////////////////
program
: block EOF
;
block
: (statement (NEW_LINE+ | EOF))*
;
statement
: assignment
| if_statement
| while_statement
| until_statement
| do_while_statement
| write
;
assignment
: ID ASSIGN expression # expressionAssignment
| ID PLUS # incrementAssignment
| ID MINUS # decrementAssignment
;
if_statement
: IF condition_block (ELSE_IF condition_block)* (ELSE NEW_LINE statement_block)? END_IF
;
condition_block
: expression NEW_LINE statement_block
;
statement_block
: block
;
while_statement
: WHILE expression NEW_LINE statement_block END_WHILE
;
until_statement
: UNTIL expression NEW_LINE statement_block END_UNTIL
;
do_while_statement
: DO_WHILE NEW_LINE statement_block DO_WHILE_CONDITION expression
;
expression
: atom # atomExpression
| expression PLUS expression # plusExpression
| expression MINUS expression # minusExpression
| expression MULTIPLY expression # multiplicationExpression
| expression DIVIDE expression # divisionExpression
| expression PLUS # incrementExpression
| expression MINUS # decrementExpression
| expression AND expression # andExpression
| expression OR expression # orExpression
| expression EQUALS expression # equalityExpression
| expression NOT_EQUALS expression # notEqualityExpression
| expression LESS_THAN expression # lessThanExpression
| expression NOT_LESS_THAN expression # notLessThanExpression
| expression GREATER_THAN expression # greaterThanExpression
| expression NOT_GREATER_THAN expression # notGreaterThanExpression
| expression GREATER_THAN_OR_EQUAL expression # greaterThanOrEqualExpression
| expression LESS_THAN_OR_EQUAL expression # lessThanOrEqualExpression
;
atom
: INT # integerAtom
| FLOAT # floatAtom
| BOOLEAN # boolAtom
| ID # idAtom
| STRING # stringAtom
| OPEN_PAR expression CLOSE_PAR # expressionAtom
;
write
: WRITE expression
;
//////////////////////////////////
// LEXER
//////////////////////////////////
PLUS : '+';
MINUS : '-';
MULTIPLY : '*';
DIVIDE : '/';
ASSIGN : 'is';
OPEN_CURLY : '{';
CLOSE_CURLY : '}';
OPEN_PAR : '(';
CLOSE_PAR : ')';
COLON : ':';
NEW_LINE : '\r'? '\n';
IF : 'if';
ELSE_IF : 'else if';
ELSE : 'else';
END_IF : 'end if';
WHILE : 'while';
END_WHILE : 'end while';
UNTIL : 'until';
END_UNTIL : 'end until';
DO_WHILE : 'do';
DO_WHILE_CONDITION : 'while';
EQUALS : 'equals';
NOT_EQUALS : 'not equals';
LESS_THAN : 'is less than';
NOT_LESS_THAN : 'is not less than';
GREATER_THAN : 'is greater than';
NOT_GREATER_THAN : 'is not greater than';
GREATER_THAN_OR_EQUAL : 'is greater than or equals';
LESS_THAN_OR_EQUAL : 'is less than or equals';
WRITE : 'write';
AND : 'and';
OR : 'or';
NOT : 'not';
BOOLEAN
: 'TRUE' | 'true' | 'YES' | 'yes'
| 'FALSE' | 'false' | 'NO' | 'no'
;
INT
: (PLUS | MINUS)? NUMBER+
;
FLOAT
: (PLUS | MINUS)? NUMBER+ ('.' | ',') (NUMBER+)?
| (PLUS | MINUS)? (NUMBER+)? ('.' | ',') NUMBER+
;
NUMBER
: '0'..'9'
;
STRING
: '"' ( '\\"' | ~["] )* '"'
;
ID
: ('a'..'z' | 'A'..'Z' | '0'..'9')+
;
WHITESPACE
: [ \t]+ -> skip
;
COMMENT
: ( ';;' .*? ';;' | ';' ~[\r\n]* ) -> skip
;
答案 0 :(得分:1)
创建令牌时,词法分析器不会考虑解析器在某一点可能需要的内容。查看描述规则的Q& A(适用于v3和v4):Antlr v3 error with parser/lexer rules
这意味着在您的情况下,规则DO_WHILE_CONDITION
:
WHILE : 'while';
...
DO_WHILE_CONDITION : 'while';
永远不会匹配。
除此之外,用白色空格“粘合”关键字通常不是一个好主意。考虑何时输入为"end if"
(2个空格)。最好创建2个令牌:END
和IF
,并在解析器规则中使用这些令牌。
尝试这样的事情:
program
: block
;
block
: NEW_LINE* (statement (NEW_LINE+ | EOF))*
;
statement
: assignment
| if_statement
| while_statement
| until_statement
| do_while_statement
| write
;
assignment
: ID IS expression # expressionAssignment
| ID PLUS # incrementAssignment
| ID MINUS # decrementAssignment
;
if_statement
: IF condition_block (ELSE IF condition_block)* (ELSE NEW_LINE statement_block)? END IF
;
condition_block
: expression NEW_LINE statement_block
;
statement_block
: block
;
while_statement
: WHILE expression NEW_LINE statement_block END WHILE
;
until_statement
: UNTIL expression NEW_LINE statement_block END UNTIL
;
do_while_statement
: DO NEW_LINE statement_block WHILE expression
;
// Added unary expressions instead of combining them in the lexer.
expression
: atom # atomExpression
| MINUS expression # unaryMinusExpression
| PLUS expression # unaryPlusExpression
| expression PLUS expression # plusExpression
| expression MINUS expression # minusExpression
| expression MULTIPLY expression # multiplicationExpression
| expression DIVIDE expression # divisionExpression
| expression PLUS # incrementExpression
| expression MINUS # decrementExpression
| expression AND expression # andExpression
| expression OR expression # orExpression
| expression EQUALS expression # equalityExpression
| expression NOT EQUALS expression # notEqualityExpression
| expression IS LESS THAN expression # lessThanExpression
| expression IS NOT LESS THAN expression # notLessThanExpression
| expression IS GREATER THAN expression # greaterThanExpression
| expression IS NOT GREATER THAN expression # notGreaterThanExpression
| expression IS GREATER THAN OR EQUALS expression # greaterThanOrEqualExpression
| expression IS LESS THAN OR EQUALS expression # lessThanOrEqualExpression
;
atom
: INT # integerAtom
| FLOAT # floatAtom
| bool # boolAtom
| ID # idAtom
| STRING # stringAtom
| OPEN_PAR expression CLOSE_PAR # expressionAtom
;
write
: WRITE expression
;
// By making this a parser rule, you needn't inspect the lexer rule
// to see if it's true or false.
bool
: TRUE
| FALSE
;
//////////////////////////////////
// LEXER
//////////////////////////////////
PLUS : '+';
MINUS : '-';
MULTIPLY : '*';
DIVIDE : '/';
OPEN_CURLY : '{';
CLOSE_CURLY : '}';
OPEN_PAR : '(';
CLOSE_PAR : ')';
COLON : ':';
NEW_LINE : '\r'? '\n';
IF : 'if';
ELSE : 'else';
END : 'end';
WHILE : 'while';
UNTIL : 'until';
DO : 'do';
EQUALS : 'equals';
NOT : 'not';
IS : 'is';
LESS : 'less';
THAN : 'than';
GREATER : 'greater';
WRITE : 'write';
AND : 'and';
OR : 'or';
TRUE : 'TRUE' | 'true' | 'YES' | 'yes';
FALSE : 'FALSE' | 'false' | 'NO' | 'no';
INT
: DIGIT+
;
// (DIGIT+)? is the same as: DIGIT*
FLOAT
: DIGIT+ [.,] DIGIT*
| DIGIT* [.,] DIGIT+
;
// If a rule can never become a token on its own (an INT will always
// be created instead of a DIGIT), mark it as a 'fragment'.
fragment DIGIT
: [0-9]
;
// Added support for escaped backslashes.
STRING
: '"' ( '\\"' | '\\\\' | ~["\\] )* '"'
;
// Can it start with a digit? Maybe this is better: [a-zA-Z] [a-zA-Z0-9]*
ID
: [a-zA-Z0-9]+
;
WHITESPACE
: [ \t]+ -> skip
;
COMMENT
: ( ';;' .*? ';;' | ';' ~[\r\n]* ) -> skip
;
哪个解析器同时构造没有问题。另请注意,我对您的语法稍作调整(请参阅内联注释)。一元表达式是一个重要的表达式,否则1-2
将被标记为2 INT
个标记,这些标记在解析器中无法与expression
匹配!