如何在Ruby Treetop树的子节点中触发函数。 (是:如何防止红宝石Treetop进行AST压榨)

时间:2018-08-01 18:04:16

标签: ruby parsing treetop

我使用树梢已有一段时间了。我按照

编写了规则

http://thingsaaronmade.com/blog/a-quick-intro-to-writing-a-parser-using-treetop.html

我可以解析我的整个输入字符串,但是除了初始一个,其他所有to_array函数都不会被触发。

然后我发现https://whitequark.org/blog/2011/09/08/treetop-typical-errors/谈论AST squashing,我发现我的规则也是如此。

我的第一个规则是

  rule bodies
    blankLine* interesting:(body+) blankLine* <Bodies>
  end

body吞噬了一切。

有人可以建议我如何解决此问题吗?

修改 添加代码段:

grammar Sexp

  rule bodies
    blankLine* interesting:(body+) blankLine* <Bodies>
  end

  rule body
    commentPortString (ifdef_blocks / interface)+ (blankLine / end_of_file) <Body>
  end

  rule interface
    space? (intf / intfWithSize) space?  newLine <Interface>
  end

  rule commentPortString
    space? '//' space portString space?  <CommentPortString>
  end

  rule portString
    'Port' space? '.' newLine <PortString>
  end

  rule expression
    space? '(' body ')' space? <Expression>
  end

  rule intf
    (input / output) space wire:wireName space? ';' <Intf>
  end

  rule intfWithSize
    (input / output) space? width:ifWidth space? wire:wireName space? ';' <IntfWithSize>
  end

  rule input
    'input' <InputDir>
  end

  rule output
    'output' <OutputDir>
  end

  rule ifdef_blocks
    ifdef_line (interface / ifdef_block)* endif_line <IfdefBlocks>
  end

  rule ifdef_block
    ifdef_line interface* endif_line <IfdefBlocks>
  end

  rule ifdef_line
    space? (ifdef / ifndef) space+  allCaps space? newLine <IfdefLine>
  end

  rule endif_line
    space? (endif) space? newLine <EndifLine>
  end

  rule ifdef
    '`ifdef' <Ifdef>
  end

  rule ifndef
    '`ifndef' <Ifndef>
  end

  rule endif
    '`endif' <Endif>
  end

  rule ifWidth
    '[' space? msb:digits space? ':' space? lsb:digits ']' <IfWidth>
  end

  rule digits
    [0-9]+ <Digits>
  end

  rule integer
    ('+' / '-')? [0-9]+ <IntegerLiteral>
  end

  rule float
    ('+' / '-')? [0-9]+ (('.' [0-9]+) / ('e' [0-9]+)) <FloatLiteral>
  end

  rule string
    '"' ('\"' / !'"' .)* '"' <StringLiteral>
  end

  rule identifier
    [a-zA-Z\=\*] [a-zA-Z0-9_\=\*]* <Identifier>
  end

  rule allCaps
    [A-Z] [A-Z0-9_]*
  end

  rule wireName
    [a-zA-Z] [a-zA-Z0-9_]* <WireName>
  end

  rule non_space
    !space .
  end

  rule space
    [^\S\n]+
  end

  rule non_space
    !space .
  end

  rule blankLine
    space* newLine
  end

  rule not_newLine
    !newLine .
  end

  rule newLine
    [\n\r]
  end

  rule end_of_file
    !.
  end

end

测试字符串

// Port.
input         CLK;

// Port.
input         REFCLK;

// Port.
input [ 41:0] mem_power_ctrl;
output data;

编辑:添加更多详细信息

测试代码签入到: https://github.com/justrajdeep/treetop_ruby_issue

正如您在我的node_extensions.rb中所见,除Bodies以外的所有节点都在方法to_array中引发异常。但是没有异常触发。

2 个答案:

答案 0 :(得分:1)

您在to_array的{​​{1}}上呼叫tree。这是您曾经调用过Bodies的唯一方法,因此不会调用其他to_array方法。

如果要在to_array节点的子节点上调用to_array,则Bodies需要在这些子节点上调用Bodies#to_array。因此,如果要在标记为to_array的{​​{1}}节点上调用它,则应遍历Body并在每个元素上调用interesting

答案 1 :(得分:0)

尝试将(body+)分解成这样的新规则:

rule bodies
   blankLine* interesting:interesting blankLine* <Bodies>
end

rule interesting
   body+ <Interesting>
end

否则,查看SyntaxNode类将很有帮助。