Question

我正在编写一个程序，需要极低延迟的纹理来屏幕流（10ms以下），我已经使用GL_ARB_buffer_storage实现了这个功能，它非常适合流式传输，而vsync则可以防止撕裂。

但是我发现NVidia管道在阻塞之前调用交换缓冲区时会缓冲2到8帧，我需要阻止它。

我所做的是以下内容：

uint64_t detectPresentTime()
{
  // warm up first as the GPU driver may have multiple buffers
  for(int i = 0; i < 10; ++i)
    glxSwapBuffers(state.renderer);

  // time 10 iterations and compute the average
  const uint64_t start = microtime();
  for(int i = 0; i < 10; ++i)
    glxSwapBuffers(state.renderer);
  const uint64_t t = (microtime() - start) / 10; 

  // ensure all buffers are flushed
  glFinish();

  DEBUG_INFO("detected: %lu (%f Hz)", t, 1000000.0f / t); 
  return t;
}

然后在绘制线程中我执行以下操作：

uint64_t presentTime = detectPresentTime();
if (presentTime > 1000)
  presentTime -= 1000;

while(running)
{
  const uint64_t start = microtime();
  glClear();

  // copy the texture to the screen

  glxSwapBuffers();

  const uint64_t delta = microtime() - start;
  if (delta < presentTime)
  {
    glFlush();
    usleep(delta);
    glFinish();
  }
}

此解决方案在NVidia硬件上运行良好，但据报道无法在AMD GPU上计算正确的当前时间。

有更好的方法吗？我知道glFinish通常不应该在应用程序中使用，除了分析，但我找不到另一种方法来确保GPU管道不缓冲帧。

编辑：对于那些感兴趣的人来说，这有效地模拟了Linux下的FastSync，但没有禁用vsync。

Edit2：也许现在的时间功能应该有点不同了：

uint64_t detectPresentTime()
{
  glFinish();

  // time 10 iterations and compute the average
  const uint64_t start = microtime();
  for(int i = 0; i < 10; ++i)
  {
    glxSwapBuffers(state.renderer);
    glFinish();
  }
  const uint64_t t = (microtime() - start) / 10; 

  DEBUG_INFO("detected: %lu (%f Hz)", t, 1000000.0f / t); 
  return t;
}

Answer 1

我找到了答案，有一个鲜为人知的名为SGI_video_sync的OpenGL扩展，使用它可以等待下一帧。

即：

glFlush();
uint remainder;
glXWaitVideoSyncSGI(1, 0, &remainder);

防止OpenGL缓冲帧

1 个答案: