numpy矢量化:查找列表与列表之间的交集

时间:2019-05-31 11:17:57

标签: python numpy vectorization

我试图找到一个列表和一个列表之间的交集。这可以通过简单的for循环轻松解决:

def find_intersec(x,y):
    result = []

    for i in range(len(y)):
        if set(x).intersection(set(y[i])):
            result.append(y[i])

    return(result)

x = [1,2,3,4,5,6]
y = [[1,2,3], [4,5,6], [9,10,11]]



find_intersec(x,y)

如何将以上内容更改为Numpy向量化解决方案?我尝试了numpy.intersect1d(),但没有成功。

2 个答案:

答案 0 :(得分:1)

You can have a function like this:

import numpy as np

def find_intersec_vec(x, y):
    y_all = np.concatenate(y)
    y_all_in = np.isin(y_all, x)
    splits = np.cumsum([0] + [len(lst) for lst in y])
    y_in = np.logical_or.reduceat(y_all_in, splits[:-1])
    return [lst for lst, isin in zip(y, y_in) if isin]

Test:

x = [1, 2, 3, 4, 5, 6]
y = [[1, 2, 3], [4, 5], [6, 7], [8, 9, 10, 11]]
print(find_intersec(x, y))
# [[1, 2, 3], [4, 5], [6, 7]]
print(find_intersec_vec(x, y))
# [[1, 2, 3], [4, 5], [6, 7]]

答案 1 :(得分:1)

As you mentioned, numpy.intersect1d() can be used:

import numpy as np

x = [1,2,3,4,5,6]
y = [[1,2,3], [4,5,6], [9,10,11]]

intersec = [np.intersect1d(i, x) for i in y if len(np.intersect1d(i, x)) > 0]

result:

[array([1, 2, 3]), array([4, 5, 6])]