根据可能的配对创建组合答案

【问题标题】：Creating combinations based on possible pairs根据可能的配对创建组合
【发布时间】：2021-10-08 07:04:03
【问题描述】：

我有n 索引导致n(n-1)/2 成对组合，例如为n=3

(i,j,k) -> (i,j), (i,k), (j,k)

现在对于每一对我都知道可能性，例如

(i,j) = (1,2), (1,3), (2,2)
(i,k) = (2,2), (1,2), (2,4)
(j,k) = (1,2), (4,3), (2,2)

换句话说，在(i,j,k) 的某种组合中，我们必须让(i,j) 是(1,2) 或(1,3) 或(2,2)，其他对也一样。我希望构造所有可能的组合，所以在上面的例子中只有两种可能的组合：

(i,j,k) = (2,2,2)
(i,j,k) = (1,2,2)

我目前已经实现了这个过程如下：

import numpy as np

ij = np.array(([1,2], [1,3], [2,2]))
ik = np.array(([2,2], [1,2], [2,4]))
jk = np.array(([1,2], [4,3], [2,2]))

possibilities = []

possible_i = np.union1d(ij[:,0], ik[:,0])
possible_j = np.union1d(ij[:,1], jk[:,0])
possible_k = np.union1d(ik[:,1], jk[:,1])

for i in possible_i:
    for j in possible_j:

        if ([i,j] == ij).all(1).any():     
            for k in possible_k:
                if (([i,k] == ik).all(1).any() and
                    ([j,k] == jk).all(1).any()):
                    print(i,j,k)

虽然这可行并且可以很容易地适应任何n，但它对我来说似乎不是很有效，例如它会检查组合：

当然，我们知道在检查(i,j,k) = (i,2,3) 无效后，我们不必重新检查此表单的其他组合。有没有更有效的方法来解决这个任务（这也适用于更高的n）？

【问题讨论】：

您是否已经查看了itertools 模块中提供的itertools.pairwise()、itertools.product()、itertools.combinations() 或itertools.permutations()？
@albert 是的！但到目前为止没有成功..
print(list(itertools.combinations(('i', 'j', 'k'), 2))) 输出 [('i', 'j'), ('i', 'k'), ('j', 'k')] 这与您的介绍性示例相同。这有什么问题？
@albert 我不确定你在暗示什么，但这绝不是我问题的答案。我正在尝试在 itertools 不支持的约束下找到组合
抱歉，我似乎不理解您的限制。

标签： python list set combinations

【解决方案1】：

问题可以用图表来描述，其中节点按列组织：

一个节点由它所在的列和它的值来唯一标识。

可以通过获取前两列之间可能的边，然后在可能的情况下将这些边扩展到大小为 2 的路径，涉及相关列的节点，然后再次扩展到大小为 3，涉及下一个列，...等。每次将节点添加到路径时，都必须验证路径中的所有先前节点都连接到该新节点。

为了有效地做到这一点，我建议使用邻接列表类型的数据结构，或者实际上是邻接set，这样您就可以从给定的列中快速获取可以访问的节点另一列中的节点。这些邻居集可以相交，以便留下满足所有约束的连接。

我会将输入约束定义为字典，因此对于给定的对列表，i 和 j 是什么（列）毫无疑问。所以示例输入将是这个字典：

{
    (0, 1): [(1,2), (1,3), (2,2)],
    (0, 2): [(2,2), (1,2), (2,4)],
    (1, 2): [(1,2), (4,3), (2,2)]
}

代码：

from collections import defaultdict

def solve(constraints):
    # n is the size of each output tuple 
    n = max(b for _, b in constraints) + 1

    # convert contraints to adjacency sets
    graph = {}
    for key, pairs in constraints.items():
        dct = defaultdict(set)
        for a, b in pairs:
            dct[a].add(b)
        graph[key] = dct

    paths = constraints[(0, 1)]
    for j in range(2, n):
        newpaths = []
        for path in paths:
            additions = graph[(0, j)][path[0]]
            for i in range(1, len(path)):
                additions &= graph[(i, j)][path[i]]
                if not additions:  # quick exit
                    break
            newpaths.extend((*path, num) for num in additions)
        paths = newpaths

    return paths

这样调用：

constraints = {
    (0, 1): [(1,2), (1,3), (2,2)],
    (0, 2): [(2,2), (1,2), (2,4)],
    (1, 2): [(1,2), (4,3), (2,2)]
}
result = solve(constraints)
print(result)

输出：

[(1, 2, 2), (2, 2, 2)]

【讨论】：

【解决方案2】：

我们用长度为n 的列表来表示可能的组合。我们尚未获得任何信息的索引将包含None。

每一轮都会处理一对索引，并检查这对的所有规则。

如果该对的第一个值存在于前一回合的可能组合中，而第二个从未被触及（因此是无），我们将其添加为本回合的新可能组合。

如果两个值都存在于先前的组合中，这确认它可能是有效的，我们也将其添加。

我们可以放弃上一回合的结果，因为我们之前认为可能但在本回合尚未验证的组合是不可能的。

所以，代码：

from itertools import combinations

def possible_combs(n, pairs_list):
    # Pairs of indices, generated in the same order as the lists of allowed pairs
    indices = combinations(range(n), r=2)
    # Current list of possible combinations. None means no information for this index
    current = [[None] * n]

    for (first, last), allowed in zip(indices, pairs_list):
        previous = current
        current = []
        # Iteration on each allowed pair for the current pair of indices
        for i, j in allowed:
            for comb in previous:
                if comb[first] is None:
                    # We can have previous combinations having None for the starting index 
                    # only during the first step. In this case, we create the path. 
                    new = comb[:]
                    new[first] = i
                    new[last] = j
                    current.append(new)
                if comb[first] == i:
                    if comb[last] is None:
                        # A path leading to a yet unknown value, we add it
                        new = comb[:]
                        new[last] = j
                        current.append(new)
                    elif comb[last] == j:
                        # A valid path, we keep it
                        current.append(comb[:])
                    # At this point, any previous combination that didn't satisfy 
                    # any rule of this turn hasn't made it
                    # to current and will be forgotten...
    return current

在您的数据上运行示例：

possible_combs(3, [[(1,2), (1,3), (2,2)],
                    [(2,2), (1,2), (2,4)],
                    [(1,2), (4,3), (2,2)]])

输出：

[[2, 2, 2], [1, 2, 2]]

请注意，它不对每对索引的规则数量做出假设。

【讨论】：

【解决方案3】：

您可以在跟踪先前包含的与输入中的元组对关联的值时使用递归。这样，可以更有效地预先生成组合，而无需事后过滤整组组合：

from collections import defaultdict
d, d1 = {(0, 1): [(1, 2), (1, 3), (2, 2)], (0, 2): [(2, 2), (1, 2), (2, 4)], (1, 2): [(1, 2), (4, 3), (2, 2)]}, defaultdict(list)
for (a, b), j in d.items():
   d1[a].extend((l:=list(zip(*j)))[0])
   d1[b].extend(l[1])

def combos(p, c = {}):
   if not p:
      yield [*c.values()]
   else:
      for i in set(d1[p[0]]):
         if len(c) < 1 or all((b, i) in d[(a, p[0])] for a, b in c.items()):
            yield from combos(p[1:], {**c, p[0]:i})

print(list(combos([*{i for k in d for i in k}])))

输出：

[[1, 2, 2], [2, 2, 2]]

【讨论】：