Question

在gcc doc的链接gcc-doc中，我看到gcc版本4.1.1具有原子内建函数。

在我的项目中，我们在centos5.0中使用gcc 4.1.1，然后在编译后，我们的项目在centos5.0之后运行良好。这是我的测试代码。

#include <string.h>
#include <stdio.h>
#include <stdlib.h>
#include <unistd.h>
#include <pthread.h>
#include <stdbool.h>

static int count = 0;

void *test_func(void *arg)
{
    int i=0;
    int result = 0;
    for(i=0;i<2000;++i)
    {
        __sync_add_and_fetch(&count,1);
    }

    return NULL;
}

int main(int argc, const char *argv[])
{
    pthread_t id[10];
    int i = 0;

    for(i=0;i<10;++i){
        pthread_create(&id[i],NULL,test_func,NULL);
    }

    for(i=0;i<10;++i){
        pthread_join(id[i],NULL);
    }
    //10*2000=20000
    printf("%u\n",count);

    return 0; 
}

当我将功能更改为以下内容时，

void *test_func(void *arg)
{
    int i=0;
    int result = 0;
    for(i=0;i<2000;++i)
    {
       result = __sync_add_and_fetch(&count,1);
    }

    return NULL;
}

gcc -g -o code code.c  -lpthread

然后编译出错：对`__sync_add_and_fetch_4'的未定义引用，我不知道为什么这是错误的。

现在我要使用功能：__ sync_val_compare_and_swap，这是stackoverflow的演示

#include <stdio.h>
#include <inttypes.h>
#include <pthread.h>
#include <stdbool.h>
#include <assert.h>

volatile bool lock;
void *locktest( void *arg )
{
    int i = 0;
    for ( i = 0 ; i < 100000 ; ++i )
    {
        // acquire a lock
        while( __sync_val_compare_and_swap( &lock, false, true ) == true )
        {
            // Spin while we don't acquire
        }

        // make sure we have the lock
        assert( lock == true );

        // release the lock
        assert( __sync_val_compare_and_swap( &lock, true, false ) == true );
    }
}

int main (void)
{
    return 0;
}

这是错误的编译： /root/library/demo/sync_val_compare.c:14：未定义对__sync_val_compare_and_swap_1' /root/library/demo/sync_val_compare.c:23: undefined reference to __ sync_val_compare_and_swap_1'的引用

我不知道为什么？我理解错了吗？

Answer 1

如果目标体系结构上不支持特定的__sync函数，则GCC会将其视为外部函数（如果要提供实现）。如果这样做，它将追加元素大小；这就是4的来源。您选择的目标体系结构似乎不支持4字节类型的原子add_and_fetch（这使我怀疑它根本不支持原子内在函数）。使用-march编译器选项来强制使用特定的体系结构可能会有所帮助；尝试-march=native，看看是否足够。

顺便说一句，对于支持它们的GCC版本，您应该使用the __atomic intrinsics而不是__sync内部函数。 __atomic内在函数可让您更好地控制内存顺序保证，从而有可能提高性能。（不过，当您尝试在不支持它们的体系结构上使用它们时，两组内部函数都出现相同的问题。）

gcc AtomicBuiltins在gcc 4.1.1中有些奇怪

1 个答案: