Question

我有一个包含值的2D数组和一个带有索引值的1D数组，在这里我想拆分2D矩阵，其中拆分后的子数组包括“分割点”。

我知道我可以使用numpy.split函数按索引进行拆分，并且我知道可以使用stride_tricks拆分数组以创建连续的重叠子集视图。

但是似乎stride_ticks仅适用于我们要将数组拆分为大小相等的子数组的情况。

最小示例，我可以执行以下操作：

>>> import numpy as np
>>> array = np.random.randint(0,10, (10,2))
>>> indices = np.array([2,3,8])
>>> array
array([[8, 1],
       [1, 0],
       [2, 0],
       [8, 8],
       [1, 6],
       [7, 8],
       [4, 4],
       [9, 4],
       [6, 7],
       [6, 4]])

>>> split_array = np.split(array, indices, axis=0)
>>> split_array
[array([[8, 1],
        [1, 0]]), 

 array([[2, 0]]), 

 array([[8, 8],
        [1, 6],
        [7, 8],
        [4, 4],
        [9, 4]]), 

 array([[6, 7],
        [6, 4]])]

但是我只是在split函数中寻找一个可以定义include_split_point=True的选项，这将给我这样的结果：

[array([[8, 1],
        [1, 0],
        [2, 0]]), 

 array([[2, 0],
        [8, 8]]), 

 array([[8, 8],
        [1, 6],
        [7, 8],
        [4, 4],
        [9, 4],
        [6, 7]]), 

 array([[6, 7],
        [6, 4]])]

Answer 1

使用重复的索引元素创建一个新数组

new_indices = np.zeros(array.shape[0], dtype = int)
new_indices[indices] = 1
new_indices += 1
new_array = np.repeat(array, new_indices, axis = 0)

更新索引以说明更改后的数组

indices = indices + np.arange(1, len(indices)+1)

照常使用索引分割

np.split(new_array, indices, axis = 0)

输出：

[array([[8, 1],
        [1, 0],
        [2, 0]]), 
 array([[2, 0],
        [8, 8]]), 
 array([[8, 8],
        [1, 6],
        [7, 8],
        [4, 4],
        [9, 4],
        [6, 7]]), 
 array([[6, 7],
        [6, 4]])]

如何通过索引拆分数组，其中拆分的子数组包含拆分点

1 个答案: