Question

我写了一个小程序，在Callgrind对它进行动态检测之前，该程序运行良好。

$ g++ -std=c++11 -pthread -g -ggdb -o program.exe program.cpp
$ time valgrind --tool=callgrind ./program.exe

代码：

#include <atomic>
#include <thread>
#include <iostream>

constexpr int CST_TARGET = 10*1000;

std::atomic<bool> g_lock = {false};
std::atomic<bool> g_got_work = {true};
int g_passer = 0;
long long g_total = 0;

void producer() {
    while (1) {
        while (g_lock.load(std::memory_order_seq_cst));
        if (g_passer >= CST_TARGET) {
            g_got_work.store(false, std::memory_order_seq_cst);
            return;
        }
        ++g_passer;
        g_lock.store(true, std::memory_order_seq_cst);
    }
}

void consumer() {
    while (g_got_work.load(std::memory_order_seq_cst)) {
        if (g_lock.load(std::memory_order_seq_cst)) {
            g_total += g_passer;
            g_lock.store(false, std::memory_order_seq_cst);
        }
    }
}

int main() {
    std::atomic<int> val(0);
    std::thread t1(producer);
    std::thread t2(consumer);
    t1.join();
    t2.join();
    std::cout << "g_passer = " << g_passer << std::endl;
    std::cout << "g_total = " << g_total << std::endl;
    return 0;
}

检测将在10分钟后结束，因此我终止了检测，并查看了KCachegrind的统计信息。 std::atomic<bool>::load(...)的呼叫量达数亿至数十亿。

有什么想法可以使Callgrind的哪些部分改变原子调用的行为并使它们失败？该程序本身无需毫秒即可运行毫秒。

Answer 1

使用--fair-sched = yes应该可以解决问题。

为什么Callgrind使原子负载永无止境

1 个答案: