Question

假设我们要在编译时构建一个平凡的表

template<int N, int M>
constexpr auto foo()
{
    std::array<std::array<int, N>, M> a = {};
    for(int m = 1; m < M; m++)
        for(int n = 1; n < N; n++)
        {
            // For exposition only
            auto x = (m ^ 42) + (n << 3) - m;
            auto y = (n ^ 420) + (m % 420);
            a[m][n] = (a[(x + m) % m][(y + n) % n] + (x ^ y)) % 0xFACADE;
        }
    return a;
}

constexpr auto bar(int n, int m)
{
    constexpr auto dim = /* something */;
    constexpr auto table = foo<dim, dim>();
    return table[n][m];
}

It doesn't take much将编译时间推高。另一种选择是通过脚本将表生成为源代码，显然要好得多。

如何减少此类函数的编译时间？

一些动机

constexpr函数与常规函数有很大的不同，与常规函数相比，它的极慢性体现了这一点。除了由编译器执行之外，它们还具有边界检查，溢出检查以及几乎所有阻止阻止的UB检查。我怀疑这会使从常规函数中收集的大多数直觉变得毫无用处。

Answer 1

我想编译器只是优化了以下几点，所以您可能没有任何优势，但是：

1）m ^ 42和m % 420不依赖于n，因此您可以在内部循环之外进行计算

2）如果我没看错，

(x + m) % m  ==  x % m + m % m
             ==  x % m + 0
             ==  x % m

和

(y + n) % n  ==  y % n + n % n
             ==  y % n + 0
             ==  y % n

3）您可以尝试向const变量中添加一些auto。

因此您可以尝试

template <int N, int M>
constexpr auto foo ()
{
    std::array<std::array<int, N>, M> a = {};

    for(int m = 1; m < M; m++)
    {
        auto const m42 = m ^ 42;
        auto const m420 = m % 420;

        for(int n = 1; n < N; n++)
        {
            // For exposition only
            auto const x = m42 + (n << 3) - m;
            auto const y = (n ^ 420) + m420;
            a[m][n] = (a[x % m][y % n] + (x ^ y)) % 0xFACADE;
        }
    }
    return a;
}

如果这可行，您可以尝试在x % m上进行工作，将x的不依赖于n（m42 - m）和依赖项（{ {1}}），因此您可以在内循环之外计算n << 3的一部分。

如何减少constexpr函数的编译时间？

一些动机

1 个答案: