Question

使用以下语法，我在这种输入中遇到语法错误：

ls /home > foo #Runs and works okay, but raises error token
ls /home /foo /bar /etc #works okay

我认为它可能与lookahead的工作方式有关，但这是我的第一个语法，我对它为什么不能这样工作有点困惑：external_cmd GT WORD是一个重定向，redirect是一个命令，命令是一个命令，因此输入命令NEWLINE应该可以工作。

语法的顶级规则：

input:
    error NEWLINE {
        printf("Error Triggered\n");
        yyclearin;
        yyerrok; 
        prompt(); 
    } |
    input NEWLINE {
        prompt();
    } | 
    input commands NEWLINE {
        prompt (); 
    } | 
    /* empty */
    ;   

commands: 
    command |   
    command SEMI | 
    command SEMI commands
    ;   

command:
    builtin_cmd |
    redirect |
    external_cmd { 
        execute_command($1, 0, NULL);
    }
    ;

redirect:
    external_cmd GT WORD  {
        printf("Redirecting stdout of %s to %s\n", $1->cmd, $3);
        //printf("DEBUG: GT\n");
        execute_command($1, STDOUT_FILENO, $3);
    }
    external_cmd LT WORD {
        printf("Redirecting stin of %s to %s\n", $1->cmd, $3);
        //printf("DEBUG: GT\n");
        execute_command($1, STDIN_FILENO, $3);
    }
    ;

引发错误令牌时的调试/详细输入：

Next token is token WORD ()
Shifting token WORD ()
Entering state 6
Reading a token: Next token is token WORD ()
Shifting token WORD ()
Entering state 24
Reading a token: Next token is token GT ()
Reducing stack by rule 22 (line 115):
   $1 = token WORD ()
-> $$ = nterm arg_list ()
Stack now 0 2 6
Entering state 26
Reducing stack by rule 19 (line 91):
   $1 = token WORD ()
   $2 = nterm arg_list ()
-> $$ = nterm external_cmd ()
Stack now 0 2
Entering state 16
Next token is token GT ()
Shifting token GT ()
Entering state 29
Reading a token: Next token is token WORD ()
Shifting token WORD ()
Entering state 33
Reducing stack by rule 11 (line 68):
Redirecting stdout of ls to foo
DEBUG: redirect mode is 1
DEBUG: Command to run is ls
DEBUG: Adding Argument /home
admin  kbrandt  tempuser
-> $$ = nterm @1 ()
Stack now 0 2 16 29 33
Entering state 34
Reading a token: Next token is token NEWLINE ()
syntax error
Error: popping nterm @1 ()
Stack now 0 2 16 29 33
Error: popping token WORD ()
Stack now 0 2 16 29
Error: popping token GT ()
Stack now 0 2 16
Error: popping nterm external_cmd ()
Stack now 0 2
Error: popping nterm input ()
Stack now 0
Shifting token error ()
Entering state 1
Next token is token NEWLINE ()
Shifting token NEWLINE ()
Entering state 3
Reducing stack by rule 1 (line 38):
   $1 = token error ()
   $2 = token NEWLINE ()
Error Triggered
-> $$ = nterm input ()
Stack now 0
Entering state 2

更新：
external_cmd是：

external_cmd:
    WORD arg_list {
        $$ = malloc( sizeof(struct ext_cmd) );
        if ( $$ == NULL)
            printf("Memory Allocation Error\n");
        $$->cmd = $1;
        $$->args_pp = $2;
    } |
    WORD    {
        $$ = malloc( sizeof(struct ext_cmd) );
        if ( $$ == NULL)
            printf("Memory Allocation Error\n");
        $$->cmd = $<string>1;
        $$->args_pp = NULL;
    }

Answer 1

语法错误来自您对第二次调用yyparse。当您具有重定向时，您的语法会执行YYACCEPT，这会导致解析器立即返回而不再读取任何内容。在第二次调用时，第一个令牌读取是NEWLINE，它会出现错误（您的语法不允许空行。）

没有重定向，没有YYACCEPT，所以语法继续运行，读取换行符并返回到输入结束。

Answer 2

你真的真的应该使用LALR（1）解析器生成器的左递归。右递归要求所有元素都转移到解析器状态堆栈上，甚至可以进行单次缩减。您可以想象这对错误恢复的影响。
究竟是什么external_cmd？它看起来有点像早期减少但很难分辨，因为你没有把它包括在内。
为什么在重定向后会调用YYACCEPT？如果您打算在每一行上重新启动解析器，那么您不应该使用递归输入收集器。只要你拥有它，就不要做YYACCEPT。

Answer 3

发现它，在我的重定向规则中有一个丢失的管道，所以有一个中间规则动作而不是两个组件，这不是我想要的。

关于野牛/ YACC语法的困惑

3 个答案: