Question

我是ANTLR的菜鸟。我正在为一种简单的语言构建编译器，但是我不明白为什么，如果我输入错误的键盘，编译器不会给我正确的错误。

那是我的语法：

            grammar Exercise;

            block  : '{' statement* '}';

            statement : assignment ';' 
                      | deletion ';' 
                      | print ';'
                      | declaration* ';'
                      | ifStat
                      | functionDecl
                      | exp
                      | block+
                      ;


            assignment : ID '=' exp;

            type  : 'int'
                  | 'boolean'
                  ;

            typeF  : 'void' ;


            declaration : type ID ;

            deletion : 'delete' ID;

            print  : 'print' exp;

            bool  : 'true' 
                  | 'false' 
                  ;

            exp : '(' exp ')'   
                | ID '(' expList? ')'
                | NUMBER 
                | bool
                | ID
                ;

            expIF   : ID EQ ID
                    | ID EQ bool
                    | ID GT ID
                    | ID LT ID
                    | ID NEQ ID 
                    | ID GTEQ ID
                    | ID LTEQ ID
                    | NOT ID
                    | ID 
                    ;

            ifStat  : 'if' '('expIF')' 'then' block ('else' block)? ;

            formalParameter  : declaration 
                             | rif declaration
                             ;

            rif : 'var';

            formalParameters    : formalParameter (',' formalParameter)* ; 

            functionDecl    : typeF ID LPAR formalParameters? RPAR block ; 

            expList : ID (',' ID )* ; 

            //IDs
            fragment CHAR  : 'a'..'z' |'A'..'Z' ;
            ID              : (CHAR)+ | (DIGIT)+ ;

            //Numbers
            fragment DIGIT : '0'..'9'; 
            NUMBER          : DIGIT+;

            OR : '||';
            AND : '&&';
            NOT : '!';
            EQ : '==';
            NEQ : '!=';
            GT : '>';
            LT : '<';
            GTEQ : '>=';
            LTEQ : '<=';
            LPAR : '(';
            RPAR : ')';

            //ESCAPE SEQUENCES
            WS              : (' '|'\t'|'\n'|'\r')-> skip;
            LINECOMMENTS  : '//' (~('\n'|'\r'))* -> skip;
            BLOCKCOMMENTS    : '/*'( ~('/'|'*')|'/'~'*'|'*'~'/'|BLOCKCOMMENTS)* '*/' -> skip;
            ERR: . -> channel(HIDDEN);

这是我的主音

   public static void main(String[] args) {

   //create lexer

       ExerciseLexer lexer = new ExerciseLexer(new ANTLRInputStream("{ double a ;  boolean d; a = 4 ; {boolean d ; int a} int s;}") );

   //create parser
       CommonTokenStream tokens = new CommonTokenStream(lexer);
       ExerciseParser parser = new ExerciseParser(tokens);

   //tell the parser to build the AST
        parser.setBuildParseTree(true);

    //build custom visitor
        ExerciseVisitorImpl visitor = new ExerciseVisitorImpl();
        ParseTree pt = parser.block();
        visitor.visit(pt);

例如，在这种情况下，我应该为“ double”关键字得到一个错误，但是我得到“第1:51行多余的输入'}'，期望为{'boolean'，';'，'int'}” 。问题是什么？非常感谢！

Answer 1

在您的语法中，statement是exp。您可能是说exp ';'。

按照书面规定，block是statement*，可以匹配exp exp。由于ID是exp，而double和a都是ID，因此double a被识别为两个连续的statement

此外，您的语法将declaration* ';'识别为statement。由于declaration*包含零声明（即空字符串）的情况，因此，单独的;匹配该产生式。我不知道那是否真的是您想要的，但是我强烈怀疑您不希望在不使用分号将两个连续的declaration分开的情况下进行匹配。

ANTLR无法识别错误的关键字

1 个答案: