Question

我正在运行以下程序：

#include <iostream>
#include <vector>
#include <cmath>
#include <cstdlib>
#include <chrono>
using namespace std;

const int N = 200;          // Number of tests.
const int M = 2000000;      // Number of pseudo-random values generated per test.
const int VALS = 2;         // Number of possible values (values from 0 to VALS-1).
const int ESP = M / VALS;   // Expected number of appearances of each value per test.

int main() {
    for (int i = 0; i < N; ++i) {
        unsigned seed = chrono::system_clock::now().time_since_epoch().count();
        srand(seed);
        vector<int> hist(VALS, 0);
        for (int j = 0; j < M; ++j) ++hist[rand() % VALS];
        int Y = 0;
        for (int j = 0; j < VALS; ++j) Y += abs(hist[j] - ESP);
        cout << Y << endl;
    }
}

该程序执行N次测试。在每个测试中，我们生成介于0和VALS-1之间的M个数字，同时我们在直方图中计算它们的外观。最后，我们在Y中累积误差，这些误差对应于直方图的每个值与期望值之间的差异。由于数字是随机生成的，因此每个测试中每个数字理想情况下都会出现M / VALS次数。

在运行我的程序后，我分析了结果数据（即Y的200个值），我意识到发生了一些我无法解释的事情。我看到，如果用vc ++编译程序并给出一些N和VALS（在这种情况下N = 200和VALS = 2），我们会得到不同M值的不同数据模式。对于某些测试，结果数据遵循正常分配，对于一些测试，它没有。此外，这种类型的结果似乎是交替的M（每个测试中产生的伪随机数的数量）增加：