Question

我正在尝试从头开始在C ++中实现循环屏障。目的是实现与Java实现尽可能一致的方法。班级参考在这里。 https://docs.oracle.com/javase/7/docs/api/java/util/concurrent/CyclicBarrier.html

现在在我的测试中，returnStatus应该用于成功触发屏障的每个线程，其值的范围是barrierLimit-1到零。我正在尝试使用原子变量和内存围栏来实现这一目标。但是我的代码无法通过测试，在某些情况下，两个线程的returnStatus值相同。

请有人建议是否有任何技术可以帮助解决此问题。我想在不使用锁的情况下解决此问题，以便我可以尽可能地真正应用无锁行为。

完整的代码参考位于https://github.com/anandkulkarnisg/CyclicBarrier/blob/master/CyclicBarrier.cpp

Sample test case result is below [ buggy case ]:

I am currently in thread id = 140578053969664.My barrier state count is = 4
I am currently in thread id = 140577877722880.My barrier state count is = 2
I am currently in thread id = 140577550407424.My barrier state count is = 1
I am currently in thread id = 140577936471808.My barrier state count is = 2
I am currently in thread id = 140577760225024.My barrier state count is = 0


The code snippet is below.

        // First check and ensure that the barrier is in good / broken state.
        if(!m_barrierState && !m_tripStatus)
        {
            // First check the status of the variable and immediately exit throwing exception if the count is zero.
            int returnResult;
            if(m_count == 0)
                throw std::string("The barrier has already tripped. Pleas reset the barrier before use again!!" + std::to_string(returnResult));

            // First ensure that the current wait gets the waiting result assigned immediately.

            std::atomic_thread_fence(std::memory_order_acquire);
            m_count.fetch_sub(1, std::memory_order_seq_cst);
            returnResult = m_count.load();
    std::atomic_thread_fence(std::memory_order_release);

Answer 1

std::atomic_thread_fence(std::memory_order_acquire);
m_count.fetch_sub(1, std::memory_order_seq_cst);      // [1]
returnResult = m_count.load();                        // [2]
std::atomic_thread_fence(std::memory_order_release);

[2]多个线程正在同时执行此步骤。 std::atomic_thread_fence不会阻止其他线程同时运行相同的代码。这样，两个线程才能获得相同的值。

相反，在标记为fetch_sub的行上捕获[1]的返回值

returnResult = m_count.fetch_sub(1, std::memory_order_seq_cst) - 1;

顺便说一句，我敢肯定你在这里不需要栅栏。（我真的不能不看更多功能就知道。）如果这样做，您可能只是将returnResult切换为原子。

您似乎正在使用隔离栅，就好像它们是事务性内存一样。他们不是。该发行版实质上控制了由使用获取的任何CPU感知到的存储顺序保证。只要不违反顺序保证，就可以在实际处理发行版之前自由传播写入。作为一个思想实验，设想执行[1]，然后进行上下文切换，经过一百万年，然后执行[2]。现在假设m_count具有与一百万年前一样的价值显然是荒谬的。该发行版可能会刷新写缓冲区，但更改可能已被刷新。

最后，如果您将seq_cst和acquire / release语义混合使用，可能会发生奇怪的事情。抱歉，这很含糊，但我对它的理解还不够，无法解释。

使用原子变量，CyclicBarrier中的C ++倒计时出错[请提供无锁的解决方案]

1 个答案: