如何让ANTLR 3.2在第一次出错时退出?

时间:2010-03-15 03:58:25

标签: error-handling antlr

在10.4节中,最终的ANTLR参考指示您覆盖不匹配()&如果要在第一次解析错误时退出,请使用recoverFromMismatchedSet()。但是,至少在ANTLR 3.2中,似乎没有mismatch()方法,并且recoverFromMismatchedSet()文档说它是“当前未使用”。因此,自该书出版以来,情况似乎发生了变化。

我应该做什么而不是退出ANTLR 3.2中的第一个解析错误?

1 个答案:

答案 0 :(得分:13)

我将此问题发布给了anltr-interest,Andrew Haritonkin回答道。巴特K是对的;你需要覆盖recoverFromMismatchedSet(),还需要recoverFromMismatchedToken()。

如果您还希望词法分析器在第一个错误时退出,则会有一个Wiki页面解释该怎么做:

http://www.antlr.org/wiki/pages/viewpage.action?pageId=5341217

简而言之,它声明:

  1. 如果你想抛出RecognitionException(或从Exception继承的任何东西)那么你必须做鬼鬼祟祟的java技巧,因为相关方法没有声明任何异常
  2. 如果可以抛出RuntimeException或Error,那么你可以覆盖nextToken()来抛出异常而不是调用recoverError(),或者你可以覆盖recoverError()。抛出异常。
  3. 这是一个在第一个词法分析器或解析器错误时退出的示例语法:

    grammar Test;
    
    @parser::members {
    
      public static void main(String[] args) throws Exception {
        String text = args[0];
        ANTLRStringStream in = new ANTLRStringStream(text);
        TestLexer lexer = new TestLexer(in);
        CommonTokenStream tokens = new CommonTokenStream(lexer);
        System.out.println(new TestParser(tokens).mainRule());
      }
    
      @Override
      protected Object recoverFromMismatchedToken(IntStream input, int ttype, BitSet follow) throws RecognitionException {
        throw new MismatchedTokenException(ttype, input);
      }
    
      @Override
      public Object recoverFromMismatchedSet(IntStream input, RecognitionException e, BitSet follow) throws RecognitionException {
        throw e;
      }
    
    }
    
    @rulecatch {
        catch (RecognitionException e) {
            throw e;
        }
    }
    
    @lexer::members {
        @Override
        public void reportError(RecognitionException e) {
            throw new RuntimeException(e);
        }
    
    }    
    
    mainRule returns [List<String> words]
        @init{$words = new ArrayList<String>();}
      :  w=Atom {$words.add($w.text);} (',' w=Atom {$words.add($w.text);} )* EOF
      ;
    
    
    Atom: '0' | '1';
    
    WS  :  ( '\t' | ' ' | '\r' | '\n'| '\u000C' )+ { $channel = HIDDEN; } ;
    

    示例输出:

    C:\Users\dan\workspace\antlrtest>java -cp .;antlr-3.2.jar TestParser "1,0" 
    [1, 0]
    
    C:\Users\dan\workspace\antlrtest>java -cp .;antlr-3.2.jar TestParser "1,,0"
    Exception in thread "main" MismatchedTokenException(6!=4)
            at TestParser.recoverFromMismatchedToken(TestParser.java:45)
            at org.antlr.runtime.BaseRecognizer.match(BaseRecognizer.java:115)
            at TestParser.mainRule(TestParser.java:86)
            at TestParser.main(TestParser.java:40)
    
    C:\Users\dan\workspace\antlrtest>java -cp .;antlr-3.2.jar TestParser "1,+0"   
    Exception in thread "main" java.lang.RuntimeException: NoViableAltException('+@[])                                                                
            at TestLexer.reportError(TestLexer.java:16)                           
            at org.antlr.runtime.Lexer.nextToken(Lexer.java:94)                   
            at org.antlr.runtime.CommonTokenStream.fillBuffer(CommonTokenStream.java:119)                             at org.antlr.runtime.CommonTokenStream.LT(CommonTokenStream.java:238) 
            at org.antlr.runtime.Parser.getCurrentInputSymbol(Parser.java:54)     
            at org.antlr.runtime.BaseRecognizer.match(BaseRecognizer.java:104)    
            at TestParser.mainRule(TestParser.java:68)                            
            at TestParser.main(TestParser.java:40)                                
    Caused by: NoViableAltException('+'@[])                                       
            at TestLexer.mTokens(TestLexer.java:165)                              
            at org.antlr.runtime.Lexer.nextToken(Lexer.java:84)
            ... 6 more