Question

如何在1,000到20,000之间生成100个随机数，C ++中的平均值为9,000？我正在研究C ++ 11库，但我找不到允许我包含平均值和范围的方法。

Answer 1

由于您不关心分布，只要它满足您的约束，到目前为止最简单的方法就是始终只生成9000。最简单的分布就是生成1000概率p和20000概率为1-p的东西，其中你已经解决了{{1}的值这给出了正确的平均值。

我强烈怀疑在开始考虑编程之前，你应该弄清楚你要做的事情的数学/统计。

Answer 2

由于您在分发方面非常灵活，因此一个简单的解决方案仍能提供合理的结果，而不必采用拒绝逻辑，这是一个triangular distribution。即你将三角形的下端设置为1,000，三角形的上端设置为20,000，三角形的尖端设置为你想要的平均值为9,000。

上面的维基百科链接表明三角形分布的平均值为：

(a + b + c) / 3

其中a和b分别是您的下限和上限，c是三角形的一角。对于您的输入，简单代数表示c = 6,000将给出您想要的平均值9,000。

在C ++的<random>标题中有一个名为std::piecewise_linear_distribution的分布，非常适合设置三角分布。这只需要两条直线。构建这种三角形分布的一种简单方法是：

std::piecewise_linear_distribution<> dist({1000., 6000., 20000.},
                                          [](double x)
                                          {
                                              return x == 6000 ? 1. : 0.;
                                          });

现在您只需将URNG插入此分发中并生成结果即可。为了理智，根据您的问题陈述收集一些重要的统计信息也很有帮助，例如最小值，最大值和平均值。

这是一个完整的程序：

#include <algorithm>
#include <iostream>
#include <numeric>
#include <random>
#include <vector>

int
main()
{
    std::mt19937_64 eng;
    std::piecewise_linear_distribution<> dist({1000., 6000., 20000.},
                                              [](double x)
                                              {
                                                  return x == 6000 ? 1. : 0.;
                                              });
    std::vector<double> results;
    for (int i = 0; i < 100; ++i)
        results.push_back(dist(eng));
    auto avg = std::accumulate(results.begin(), results.end(), 0.) / results.size();
    auto minmax = std::minmax_element(results.begin(), results.end());
    std::cout << "size = " << results.size() << '\n';
    std::cout << "min = " << *minmax.first << '\n';
    std::cout << "avg = " << avg << '\n';
    std::cout << "max = " << *minmax.second << '\n';
}

此应可移植输出：

size = 100
min = 2353.05
avg = 8972.1
max = 18162.5

如果您将采样值的数量调高到足够高，您将看到参数收敛：

size = 10000000
min = 1003.08
avg = 8998.91
max = 19995.5

根据需要播种。

Answer 3

以下是如何使用正态分布执行此操作，使用样本丢弃技术将值保持在边界内。这当然会扭曲分布，因此不再正常。这个效果有多大取决于您对标准偏差的原始选择。对您而言重要的程度取决于您的申请。

#include <iostream>
#include <random>

double get_random_number_with_minimum_mean_and_maximum(
    double minimum,
    double mean,
    double maximum
) {

    // Any uniform random generator of your choice could be used here.
    // Obviously making this static is not rentrant or thread-safe.
    static std::mt19937 generator;

    // We'll start with a normal distribution with a standard deviation set
    // such that ~99.7% of the time we'll get a number within the average width
    // of your upper and lower bounds.
    //
    // Why ~99.7% and not some other number? Because that corresponds to three
    // standard deviations, and you didn't specify any requirements, so I'm
    // just making assumptions on your behalf.
    double const average_bound_width = ((mean-minimum) + (maximum-mean)) / 2;
    double const standard_deviation  = average_bound_width / 3;
    std::normal_distribution<double> distribution(mean, standard_deviation);

    // Now, keep fetching numbers until we find one that is within our desired
    // bounds. Throwing numbers away randomly from a normal distribution does
    // not affect the mean, but since our bounds are going to be skewed, our
    // mean will probably get skewed slightly, but this will likely not be
    // very noticable.
    double value;
    do {
        value = distribution(generator);
    } while (value < minimum || maximum < value);

    return value;
}

int main() {
    for (int i=0; i<100; ++i) {
        std::cout << get_random_number_with_minimum_mean_and_maximum(
            1000,
            9000,
            20000
        ) << '\n';
    }
}

当我运行它时，我得到了这个输出：

平均值为~8980，仅100个样本非常接近9000。

如果我用100万个样本运行，平均值是~9049。

Answer 4

这是一个连续的"power-shifted" distribution，可以返回范围内的任何值，并一次执行。平均值是所希望的，但模式通常在最接近平均值的两端。

double shiftable_distribution(double minimum, double mean, double maximum)
{
    double range = maximum - minimum;
    double scaledmean = (mean - minimum) / range;
    return pow((double)rand() / RAND_MAX, 
               (1.0 - scaledmean) / scaledmean) 
                * range + minimum;
}

分布实际上是什么样的（1000,9000,20000）

enter image description here

Answer 5

执行此操作的一种方法：如果均匀生成范围为[l，x]的随机值的0≤a≤1，并且生成的 1-a [x，h]范围内的随机值均匀，平均值为：

m =（（l + x）/ 2）* a +（（x + h）/ 2）*（1-a）

因此，如果您想要特定的 m ，则可以使用 a 和 x 。

例如，您可以设置 x = m ，因此 a =（h-m）/（h-l）

如何使用C ++中的均值生成随机数？

5 个答案: