Question

从Python中的列表重新定位子列表的最快方法是什么？

假设我们有一个列表L = [a,b,c,d,e,f,g,h]，现在我想要[c,d,e]并将其放在列表中的g之后。我怎样才能快速？

编辑：换句话说，我想写一个函数：

从L中提取长度 n 的子列表L_sub，留下L_temp
将L_sub的项目在给定位置 i 插入L_temp

我猜的主要问题是如何尽快将列表插入列表。

Answer 1

我认为OP想要在现场这样做。

快速操作的关键是最小化列表的创建和列表的缩短/延长。这意味着我们必须努力始终对列表索引进行1：1分配，因此不需要L[i:i] = L[a:b]和L[a:b] = []。使用insert和pop的循环更糟糕，因为这会使列表缩短和延长很多次。连接列表也很糟糕，因为您首先必须为每个部分创建一个列表，然后为每个+创建一个更大和更大的连接列表。由于您希望“就地”执行此操作，因此您必须最终将生成的列表分配给L[:]。

    # items:   0 | 1   2   3 | 4   5   6   7 | 8   9
    #            a   span1   b     span2     c
    # pos:       1           4               8

    # Result:
    #          0 | 4   5   6   7 | 1   2   3 | 8   9
    #            a     span2         span2   c

让我们先做一个观察。如果a = start，b = end = start + length和c是插入位置，那么我们希望执行的操作是切换|标记并交换span1和span2。但如果b = start和c = end以及a是插入位置，那么我们也想要交换span1和span2。所以在我们的函数中，我们只处理必须交换的两个连续段。

我们无法完全避免制作新列表，因为我们需要在移动内容时存储重叠值。我们可以通过选择要存储到临时列表中的两个跨区中的哪一个来使列表尽可能短。

def inplace_shift(L, start, length, pos):
    if pos > start + length:
        (a, b, c) = (start, start + length, pos)
    elif pos < start:
        (a, b, c) = (pos, start, start + length)
    else:
        raise ValueError("Cannot shift a subsequence to inside itself")
    if not (0 <= a < b < c <= len(L)):
        msg = "Index check 0 <= {0} < {1} < {2} <= {3} failed."
        raise ValueError(msg.format(a, b, c, len(L)))

    span1, span2 = (b - a, c - b)
    if span1 < span2:
        tmp = L[a:b]
        L[a:a + span2] = L[b:c]
        L[c - span1:c] = tmp
    else:
        tmp = L[b:c]
        L[a + span2:c] = L[a:b]
        L[a:a + span2] = tmp

科斯似乎在他的时间上犯了一个错误，所以我在纠正了论点（从end和start计算length）之后用他的代码重新编写了这些错误，这些是结果，从最慢到最快。

Nick Craig-Wood: 100 loops, best of 3: 8.58 msec per loop 
vivek: 100 loops, best of 3: 4.36 msec per loop
PaulP.R.O. (deleted?): 1000 loops, best of 3: 838 usec per loop
unbeli: 1000 loops, best of 3: 264 usec per loop
lazyr: 10000 loops, best of 3: 44.6 usec per loop

我没有测试任何其他方法产生正确的结果。

Answer 2

我会用python子串

来做

def subshift(L, start, end, insert_at):
    temp = L[start:end]
    L = L[:start] + L[end:]
    return L[:insert_at] + temp + L[insert_at:]

print subshift(['a','b','c','d','e','f','g','h'], 2, 5, 4)

start和end指的是要剪切的子字符串的位置（结尾在通常的python样式中是非排他的。insert_at指的是插入子字符串的位置之后再次退回。

我认为如果子字符串超过一个字符或两个字符长度，那么这将比迭代中的任何解决方案更快，因为优秀的C代码正在进行繁重的工作。

Answer 3

让我们看看到目前为止我们得到了什么：

代码

def subshift(L, start, end, insert_at):
    'Nick Craig-Wood'
    temp = L[start:end]
    L = L[:start] + L[end:]
    return L[:insert_at] + temp + L[insert_at:]

# (promising but buggy, needs correction;
# see comments at unbeli's answer)
def unbeli(x, start, end, at): 
    'unbeli'
    x[at:at] = x[start:end]
    x[start:end] = []

def subshift2(L, start, length, pos):
    'PaulP.R.O.'
    temp = pos - length
    S = L[start:length+start]
    for i in range(start, temp):
        L[i] = L[i + length]
    for i in range(0,length):
        L[i + temp] = S[i]
    return L

def shift(L,start,n,i):
    'vivek'
    return L[:start]+L[start+n:i]+L[start:start+n]+L[i:]

基准：

> args = range(100000), 3000, 2000, 60000

> timeit subshift(*args)
100 loops, best of 3: 6.43 ms per loop

  > timeit unbeli(*args)
1000000 loops, best of 3: 631 ns per loop

> timeit subshift2(*args)
100 loops, best of 3: 11 ms per loop

> timeit shift(*args)
100 loops, best of 3: 4.28 ms per loop

Answer 4

这是一个替代的现场解决方案：

def movesec(l,srcIndex,n,dstIndex):
    if srcIndex+n>dstIndex: raise ValueError("overlapping indexes")
    for i in range(n):
        l.insert(dstIndex+1,l.pop(srcIndex))

    return l


print range(10)
print movesec(range(10),3,2,6)

输出：

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]    # orginal
[0, 1, 2, 5, 6, 7, 3, 4, 8, 9]    # modified

Answer 5

>>> L = ['a','b','c','d','e','f','g','h']
>>> L[7:7] = L[2:5]
>>> L[2:5] = []
>>> L
['a', 'b', 'f', 'g', 'c', 'd', 'e', 'h']

Answer 6

def shift(L,start,n,i):
    return L[:start]+L[start+n:i]+L[start:start+n]+L[i:]

在python中重新定位子列表的最快方法

6 个答案: