为什么valgrind报告glibc tsearch()随机泄漏内存?

时间:2018-10-10 00:58:15

标签: c valgrind glibc

我正在使用glibc tsearch() API系列在示例程序中存储动态分配的数据块。

我发现,当我使用tsearch()向树中添加几个malloc() ed块时,valgrind将其中一些块报告为“可能丢失”。尽管“可能丢失”与“绝对丢失”并不完全相同,但是previous SO advice通常是调查造成这些情况的原因。

我的示例程序如下:

#include <stdio.h>
#include <search.h>
#include <stdlib.h>
#include <signal.h>

struct data {
    int id;
    char *str;
};

static int
compare (const void *a, const void *b) {
    const struct data *data_a = a, *data_b = b;

    if (data_a->id < data_b->id) {
        return -1;
    } else if (data_a->id > data_b->id) {
        return 1;
    } else {
        return 0;
    }
}

int main (int argc, char **argv) {
    void *tree = NULL;
    struct data *d1, *d2, *d3, *d4;

    d1 = malloc(sizeof(struct data));
    d1->id = 10;
    d1->str = "Hello";
    tsearch(d1, &tree, compare);

    d2 = malloc(sizeof(struct data));
    d2->id = 30;
    d2->str = "Goodbye";
    tsearch(d2, &tree, compare);

    d3 = malloc(sizeof(struct data));
    d3->id = 20;
    d3->str = "Thanks";
    tsearch(d3, &tree, compare);

    d4 = malloc(sizeof(struct data));
    d4->id = 40;
    d4->str = "OK";
    tsearch(d4, &tree, compare);

    raise(SIGINT);

    return 0;
}

请注意,我要在raise(SIGINT)的末尾调用main(),以便valgrind仍然能够在隐式free() d之前分析已分配的块。

我正在valgrind下编译并运行,如下所示:

$ gcc ts.c -o ts
$ valgrind --leak-check=full ./ts
==2091== Memcheck, a memory error detector
==2091== Copyright (C) 2002-2017, and GNU GPL'd, by Julian Seward et al.
==2091== Using Valgrind-3.13.0 and LibVEX; rerun with -h for copyright info
==2091== Command: ./ts
==2091== 
==2091== 
==2091== Process terminating with default action of signal 2 (SIGINT)
==2091==    at 0x4E7AE97: raise (raise.c:51)
==2091==    by 0x1088CE: main (in /home/ubuntu/ts)
==2091== 
==2091== HEAP SUMMARY:
==2091==     in use at exit: 160 bytes in 8 blocks
==2091==   total heap usage: 8 allocs, 0 frees, 160 bytes allocated
==2091== 
==2091== 24 bytes in 1 blocks are possibly lost in loss record 8 of 8
==2091==    at 0x4C2FB0F: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so)
==2091==    by 0x4F59483: tsearch (tsearch.c:338)
==2091==    by 0x108801: main (in /home/ubuntu/ts)
==2091== 
==2091== LEAK SUMMARY:
==2091==    definitely lost: 0 bytes in 0 blocks
==2091==    indirectly lost: 0 bytes in 0 blocks
==2091==      possibly lost: 24 bytes in 1 blocks
==2091==    still reachable: 136 bytes in 7 blocks
==2091==         suppressed: 0 bytes in 0 blocks
==2091== Reachable blocks (those to which a pointer was found) are not shown.
==2091== To see them, rerun with: --leak-check=full --show-leak-kinds=all
==2091== 
==2091== For counts of detected and suppressed errors, rerun with: -v
==2091== ERROR SUMMARY: 1 errors from 1 contexts (suppressed: 0 from 0)

$ 

Valgrind报告一个24字节块丢失。如果我将raise(SIGINT)移到d4分配和树添加之前,则不会报告丢失任何块。

为什么在添加4个块时丢失了一个块,即使在添加3个块时都没有丢失呢?

1 个答案:

答案 0 :(得分:4)

事实证明,glibc tsearch()的实现有点顽皮,可以将指针中的低序位扭动到存储在二进制搜索树中的块。它使用指针中的低位存储标志: https://code.woboq.org/userspace/glibc/misc/tsearch.c.html#341

特别是,该实现使用这些宏来设置或清除低阶指针位,以分别将块标记为红色或黑色:

#define SETRED(N) (N)->left_node |= ((uintptr_t) 0x1)
#define SETBLACK(N) (N)->left_node &= ~((uintptr_t) 0x1)

有效地增加了指针,这使valgrind认为这些有效指针(存储在tsearch()树中)已经overwritten and therefore possibly lost。但是,这些不是丢失的块-它们已成功地以低阶位为模存储在二进制搜索树中。 tsearch()将在访问树时对这些位进行必要的屏蔽。 tsearch()之所以可以这样做,是因为malloc()个块通常与至少偶数地址对齐。

只有在二进制树中标记为“红色”节点的块才设置了该位,因此其模式是否“可能丢失”完全取决于实现将块归类为“红色”或“红色”的方式。黑色,添加,删除和重新平衡操作期间。

因此tsearch()的纠结使valgrind错误地认为这些块已丢失。在这种情况下,valgrind报告的是误报。