我的扩展使用Ruby库通过接口发送和接收命令和遥测数据包,并将它们提供给C文件传输库进行处理。
收到大量数据包后,我会得到这个段错误。这是GDB的回溯。
#0 search_method (defined_class_ptr=<synthetic pointer>, id=6177, klass=<optimized out>) at vm_method.c:567
#1 rb_method_entry_get_without_cache (klass=19934160, id=6177, defined_class_ptr=0x12c5500) at vm_method.c:592
#2 0x00007ffff7aced90 in rb_method_entry (klass=klass@entry=19934160, id=<optimized out>,
defined_class_ptr=defined_class_ptr@entry=0x12c5500) at vm_method.c:663
#3 0x00007ffff7acee71 in vm_search_method (ci=ci@entry=0x12c54c0, recv=<optimized out>) at vm_insnhelper.c:842
#4 0x00007ffff7ad4b3d in vm_exec_core (th=th@entry=0x6035d0, initial=initial@entry=0) at insns.def:1068
#5 0x00007ffff7ad951b in vm_exec (th=th@entry=0x6035d0) at vm.c:1440
#6 0x00007ffff7adac91 in vm_call0_body (th=th@entry=0x6035d0, ci=ci@entry=0x7fffffffbcb0, argv=0x7fffffffbd60)
at vm_eval.c:180
#7 0x00007ffff7adb6c6 in vm_call0 (defined_class=<optimized out>, me=<optimized out>, argv=<optimized out>,
argc=1, id=14481, recv=22956880, th=0x6035d0) at vm_eval.c:59
#8 rb_call0 (recv=recv@entry=22956880, mid=14481, argc=argc@entry=1, argv=argv@entry=0x7fffffffbd60,
scope=scope@entry=CALL_FCALL, self=<optimized out>) at vm_eval.c:349
#9 0x00007ffff7adbc64 in rb_call (scope=CALL_FCALL, argv=0x7fffffffbd60, argc=1, mid=<optimized out>,
recv=22956880) at vm_eval.c:616
#10 rb_funcall (recv=recv@entry=22956880, mid=<optimized out>, n=n@entry=1) at vm_eval.c:818
#11 0x00007ffff6092b10 in DL_RecvPdu () at CFGroundClient.c:414
#12 0x00007ffff6092dbe in method_run (self=<optimized out>) at CFGroundClient.c:302
#13 0x00007ffff7ace272 in vm_call_cfunc_with_frame (ci=0x0, reg_cfp=0x7ffff7fd1f70, th=0x6035d0)
at vm_insnhelper.c:1380
#14 vm_call_cfunc (th=th@entry=0x6035d0, reg_cfp=reg_cfp@entry=0x7ffff7fd1f70, ci=ci@entry=0xa29090)
at vm_insnhelper.c:1473
#15 0x00007ffff7ae0f6e in vm_call_method (th=0x6035d0, cfp=0x7ffff7fd1f70, ci=0xa29090) at vm_insnhelper.c:1689
#16 0x00007ffff7ad4b4b in vm_exec_core (th=th@entry=0x6035d0, initial=initial@entry=0) at insns.def:1069
#17 0x00007ffff7ad951b in vm_exec (th=th@entry=0x6035d0) at vm.c:1440
#18 0x00007ffff7adaa73 in rb_iseq_eval_main (iseqval=iseqval@entry=10626960) at vm.c:1685
#19 0x00007ffff798707d in ruby_exec_internal (n=0xa22790) at eval.c:254
#20 0x00007ffff798948d in ruby_exec_node (n=n@entry=0xa22790) at eval.c:319
#21 0x00007ffff798bc3e in ruby_run_node (n=0xa22790) at eval.c:311
#22 0x000000000040087b in main (argc=3, argv=0x7fffffffde58) at main.c:36
以下是相关代码:
void DL_RecvPdu() {
.....
/* Segfault coming from here
CUR_PACKET.read("PDU_DATA");
Reads telemetry item PDU_DATA from packet CUR_PACKET
and returns results. PDU data is an array of bytes
represented as a Ruby string that can contain nulls */
VALUE pdu_data_block = rb_funcall(rb_CUR_PACKET,
rb_READ, 1,
rb_PDU_DATA
);
....
}
有谁知道这个错误可能来自哪里以及我如何解决它?
答案 0 :(得分:0)
这个问题有很多可能的原因,而不是(似乎)足够的数据。
一种解释可能是Ruby对象(即rb_CUR_PACKET
,rb_READ
或rb_PDU_DATA
)在您调用函数时无效,因为它们可能是由GC(垃圾收集器)。
如果Ruby对象未引用对象,则它们的内存可能会比您预期的更快释放。
一个简单的解决方案是将对象注册为全局对象(它们永远不会被释放)或将它们作为变量分配到用户的对象中。
另一个可能的原因是您没有在GIL(全局指令锁定)中执行代码。
Ruby MRI代码必须始终在GIL内和Ruby线程内运行。如果你离开了GIL或者生成了不是Ruby线程的新线程,那么如果你很幸运,你的代码就会崩溃。
我怀疑GC,但这是一个折腾。
祝你好运!