Question

考虑一组13名丹麦人，11名日本人和8名波兰人。众所周知，将这组人分组的不同方式的数量是13 + 11 + 8 = 32：贝尔数（设置分区的数量）。但是，我们被要求在给定约束下找到可能的集合分区的数量。问题如下：

如果设置分区没有由至少两个仅包含单一国籍的人组成的群组，则称其为良好。这个集合有多少好分区？（一组可能只包括一个人。）

蛮力方法需要经过大约10 ^ 26个分区并检查哪些分区是好的。这似乎是不可行的，特别是如果团体较大或一个人介绍其他国籍。是否有一种聪明的方式呢？

编辑：作为旁注。一个非常好的解决方案可能没有希望。一个备受尊敬的组合学专家answered一个相关的问题，我认为，基本上说相关问题，因此这个问题也很难准确解决。

Answer 1

这是使用动态编程的解决方案。

它从空集开始，然后一次添加一个元素并计算所有有效分区。

状态空间是巨大的，但请注意，为了能够计算下一步，我们只需要了解以下内容：

对于每个国籍，它包含多少套，只包含该国籍的一个成员。（例如：{a}）
混合元素包含多少套。（例如：{a，b，c}）

对于这些配置中的每一个，我只存储总计数。例如：

[0, 1, 2, 2] -> 3
{a}{b}{c}{mixed} 
   e.g.: 3 partitions that look like: {b}, {c}, {c}, {a,c}, {b,c}

这是python中的代码：

import collections
from operator import mul
from fractions import Fraction

def nCk(n,k):
  return int( reduce(mul, (Fraction(n-i, i+1) for i in range(k)), 1) )

def good_partitions(l):
    n = len(l)
    i = 0
    prev = collections.defaultdict(int)
    while l:
        #any more from this kind?
        if l[0] == 0:
            l.pop(0)
            i += 1
            continue
        l[0] -= 1
        curr = collections.defaultdict(int)

        for solution,total in prev.iteritems():
            for idx,item in enumerate(solution):
                my_solution = list(solution)
                if idx == i:
                    # add element as a new set
                    my_solution[i] += 1
                    curr[tuple(my_solution)] += total
                elif my_solution[idx]:
                    if idx != n:
                        # add to a set consisting of one element
                        # or merge into multiple sets that consist of one element
                        cnt = my_solution[idx]
                        c = cnt
                        while c > 0:
                            my_solution = list(solution)
                            my_solution[n] += 1
                            my_solution[idx] -= c
                            curr[tuple(my_solution)] += total * nCk(cnt, c)
                            c -= 1
                    else:
                        # add to a mixed set
                        cnt = my_solution[idx]
                        curr[tuple(my_solution)] += total * cnt

        if not prev:
            # one set with one element
            lone = [0] * (n+1)
            lone[i] = 1
            curr[tuple(lone)] = 1

        prev = curr
    return sum(prev.values())

print good_partitions([1, 1, 1, 1])      # 15
print good_partitions([1, 1, 1, 1, 1])   # 52
print good_partitions([2, 1])            # 4
print good_partitions([13, 11, 8])       # 29811734589499214658370837

它为测试用例生成正确的值。我还针对蛮力解决方案（对于小值）进行了测试，并产生了相同的结果。

Answer 2

精确的分析解决方案很难，但多项式时间+空间动态编程解决方案很简单。

首先，我们需要一个关于组大小的绝对顺序。我们通过比较我们有多少丹麦人，日本人和波兰人来做到这一点。

接下来，写入的功能就是这个。

m is the maximum group size we can emit

p is the number of people of each nationality that we have left to split

max_good_partitions_of_maximum_size(m, p) is the number of "good partitions"
we can form from p people, with no group being larger than m

显然你可以把它写成一个有点复杂的递归函数，它总是选择要使用的下一个分区，然后用它作为新的最大大小调用自己，并从p中减去分区。如果您有此功能，那么您的回答只是max_good_partitions_of_maximum_size(p, p) p = [13, 11, 8]。但这将是一场在合理的时间内无法进行的强力搜索。

最后通过缓存对此函数的每次调用来应用https://en.wikipedia.org/wiki/Memoization，它将在多项式时间内运行。但是，您还必须缓存多项式调用次数。

具有给定约束的分区数

2 个答案: