获取yecc中的行号

时间:2018-01-24 07:02:09

标签: parsing erlang elixir yecc

我正在使用yecc来解析我的标记化asm式代码。在提供像"MOV [1], [2]\nJMP hello"这样的代码之后,在lexer'ing之后,这就是我得到的回应。

[{:opcode, 1, :MOV}, {:register, 1, 1}, {:",", 1}, {:register, 1, 2},
  {:opcode, 2, :JMP}, {:identifer, 2, :hello}]

当我解析这个时,我正在

[%{operation: [:MOV, [:REGISTER, 1], [:REGISTER, 2]]},
  %{operation: [:JMP, [:CONST, :hello]]}]

但我希望每个操作都有行号,以便在代码中进一步获得有意义的错误。

所以我将解析器更改为:

Nonterminals
code statement operation value.

Terminals
label identifer integer ',' opcode register address address_in_register line_number.

Rootsymbol code.

code -> line_number statement      : [{get_line('$1'), '$2'}].
code -> line_number statement code : [{get_line('$1'), '$2'} | '$3'].
%code -> statement      : ['$1'].
%code -> statement code : ['$1' | '$2'].

statement -> label     : #{'label' => label('$1')}.
statement -> operation : #{'operation' => '$1'}.

operation -> opcode value ',' value : [operation('$1'), '$2', '$4'].
operation -> opcode value           : [operation('$1'), '$2'].
operation -> opcode identifer       : [operation('$1'), value('$2')].
operation -> opcode                 : [operation('$1')].

value -> integer  : value('$1').
value -> register : value('$1').
value -> address  : value('$1').
value -> address_in_register : value('$1').

Erlang code.
get_line({_, Line, _})                 -> Line.

operation({opcode, _, OpcodeName})     -> OpcodeName.

label({label, _, Value})               -> Value.

value({identifer, _, Value})           -> ['CONST', Value];
value({integer, _, Value})             -> ['CONST', Value];
value({register, _, Value})            -> ['REGISTER', Value];
value({address, _, Value})             -> ['ADDRESS', Value];
value({address_in_register, _, Value}) -> ['ADDRESS_IN_REGISTER', Value].

(评论code是旧的,工作规则)

现在我来了 {:error, {1, :assembler_parser, ['syntax error before: ', ['\'MOV\'']]}}

提供相同的输入后。如何解决这个问题?

1 个答案:

答案 0 :(得分:1)

我的建议是将代号保留在代币中,而不是单独的代币,然后更改构建操作的方式。

所以我会建议:

operation -> opcode value ',' value : [operation('$1'), line('$1'), '$2', '$4'].
operation -> opcode value           : [operation('$1'), line('$1'), '$2'].
operation -> opcode identifer       : [operation('$1'), line('$1'), value('$2')].
operation -> opcode                 : [operation('$1'), line('$1')].

line({_, Line, _}) -> Line.

如果您想镜像Elixir AST,即使这样:

operation -> opcode value ',' value : {operation('$1'), meta('$1'), ['$2', '$4']}.
operation -> opcode value           : {operation('$1'), meta('$1'), ['$2']}.
operation -> opcode identifer       : {operation('$1'), meta('$1'), [value('$2')]}.
operation -> opcode                 : {operation('$1'), meta('$1'), []}.

meta({_, Line, _}) -> [{line, Line}].