EDITED

Question

目的是在给定整数列表的情况下找到增加/单调数字的组。结果组中的每个项目必须与前一个项目的增量+1相同

给出一个输入：

x = [7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]

我需要找到数量不断增加的群体并实现：

increasing_numbers = [(7,8,9,10), (0,1,2,3,4,5)]

最终还有越来越多的数字：

len(list(chain(*increasing_numbers)))

还有小组的镜头：

increasing_num_groups_length = [len(i) for i in increasing_numbers]

我已尝试以下方法来获取增加的数字：

>>> from itertools import tee, chain
>>> def pairwise(iterable): 
...     a, b = tee(iterable)
...     next(b, None)
...     return zip(a, b)
... 
>>> x = [8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6]
>>> set(list(chain(*[(i,j) for i,j in pairwise(x) if j-1==i])))
set([1, 2, 3, 4, 5, 6, 8, 9, 10, 11])
>>> len(set(list(chain(*[(i,j) for i,j in pairwise(x) if j-1==i]))))
10

但是我无法保持秩序和数量不断增加的群体。

如何实现increasing_numbers整数元组组以及increasing_num_groups_length？

此外，是否有此类/类似问题的名称？

EDITED

我已经提出了这个解决方案，但它非常冗长，我确信有更简单的方法来实现increasing_numbers输出：

>>> from itertools import tee, chain
>>> def pairwise(iterable): 
...     a, b = tee(iterable)
...     next(b, None)
...     return zip(a, b)
... 
>>> x = [8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6]
>>> boundary =  iter([0] + [i+1 for i, (j,k) in enumerate(pairwise(x)) if j+1!=k] + [len(x)])
>>> [tuple(x[i:next(boundary)]) for i in boundary]
[(8, 9, 10, 11), (1, 2, 3, 4, 5, 6)]

有更多的pythonic / less verbose方法吗？

另一个输入/输出示例：

[IN]：

[17,17,19,20,21,22,0,1,2,2,4,5,6,7,8,9,10,11,12,13， 14,14,14,28,29,30,31,32,33,34,35,36,40]

[OUT]：

[（19,20,21,22），（0,1,2），（4,5,6,7,8,9,10,11,12,13,14），（28,29,30,31,32,33,34,35,36）]

Answer 1

修改：

这是代码高尔夫解决方案（142个字符）：

def f(x):s=[0]+[i for i in range(1,len(x)) if x[i]!=x[i-1]+1]+[len(x)];return [x[j:k] for j,k in [s[i:i+2] for i in range(len(s)-1)] if k-j>1]

扩展版本：

def igroups(x): s = [0] + [i for i in range(1, len(x)) if x[i] != x[i-1] + 1] + [len(x)] return [x[j:k] for j, k in [s[i:i+2] for i in range(len(s)-1)] if k - j > 1]

评论版：

def igroups(x): # find the boundaries where numbers are not consecutive boundaries = [i for i in range(1, len(x)) if x[i] != x[i-1] + 1] # add the start and end boundaries boundaries = [0] + boundaries + [len(x)] # take the boundaries as pairwise slices slices = [boundaries[i:i + 2] for i in range(len(boundaries) - 1)] # extract all sequences with length greater than one return [x[start:end] for start, end in slices if end - start > 1]

原始解决方案：

不确定这是否算作“pythonic”或“不太冗长”：

def igroups(iterable): items = iter(iterable) a, b = None, next(items, None) result = [b] while b is not None: a, b = b, next(items, None) if b is not None and a + 1 == b: result.append(b) else: if len(result) > 1: yield tuple(result) result = [b] print(list(igroups([]))) print(list(igroups([0, 0, 0]))) print(list(igroups([7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]))) print(list(igroups([8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6]))) print(list(igroups([9, 1, 2, 3, 1, 1, 2, 3, 5])))

输出：

[] [] [(7, 8, 9, 10), (0, 1, 2, 3, 4, 5)] [(8, 9, 10, 11), (1, 2, 3, 4, 5, 6)] [(1, 2, 3), (1, 2, 3)]

Answer 2

使用itertools和numpy的几种不同方式：

from itertools import groupby, tee, cycle

x = [17, 17, 19, 20, 21, 22, 0, 1, 2, 2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 14, 14, 28, 29, 30, 31, 32, 33, 34, 35,
     36, 1, 2, 3, 4,34,54]


def sequences(l):
    x2 = cycle(l)
    next(x2)
    grps = groupby(l, key=lambda j: j + 1 == next(x2))
    for k, v in grps:
        if k:
            yield tuple(v) + (next((next(grps)[1])),)


print(list(sequences(x)))

[(19, 20, 21, 22), (0, 1, 2), (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14), (28, 29, 30, 31, 32, 33, 34, 35, 36), (1, 2, 3, 4)]

或者使用python3并从中获取：

def sequences(l):
    x2 = cycle(l)
    next(x2)
    grps = groupby(l, key=lambda j: j + 1 == next(x2))
    yield from (tuple(v) + (next((next(grps)[1])),) for k,v in grps if k)

print(list(sequences(x)))

使用我的回答here与numpy.split：

的变体

out = [tuple(arr) for arr in np.split(x, np.where(np.diff(x) != 1)[0] + 1) if arr.size > 1]

print(out)

[(19, 20, 21, 22), (0, 1, 2), (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14), (28, 29, 30, 31, 32, 33, 34, 35, 36), (1, 2, 3, 4)]

与ekhumoro的答案类似：

def sequences(x):
    it = iter(x)
    prev, temp = next(it), []
    while prev is not None:
        start = next(it, None)
        if prev + 1 == start:
            temp.append(prev)
        elif temp:
            yield tuple(temp + [prev])
            temp = []
        prev = start

获取长度和元组：

def sequences(l):
    x2 = cycle(l)
    next(x2)
    grps = groupby(l, key=lambda j: j + 1 == next(x2))
    for k, v in grps:
        if k:
            t = tuple(v) + (next(next(grps)[1]),)
            yield t, len(t)


def sequences(l):
    x2 = cycle(l)
    next(x2)
    grps = groupby(l, lambda j: j + 1 == next(x2))
    yield from ((t, len(t)) for t in (tuple(v) + (next(next(grps)[1]),)
                                      for k, v in grps if k))



def sequences(x):
        it = iter(x)
        prev, temp = next(it), []
        while prev is not None:
            start = next(it, None)
            if prev + 1 == start:
                temp.append(prev)
            elif temp:
                yield tuple(temp + [prev]), len(temp) + 1
                temp = []
            prev = start

所有三个的输出都是相同的：

[((19, 20, 21, 22), 4), ((0, 1, 2), 3), ((4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14), 11)
, ((28, 29, 30, 31, 32, 33, 34, 35, 36), 9), ((1, 2, 3, 4), 4)]

Answer 3

我认为最易于维护的解决方案是简化：

def group_by(l):
    res = [[l[0]]]
    for i in range(1, len(l)):
        if l[i-1] < l[i]:
            res[-1].append(l[i])
        else:
            res.append([l[i]])
    return res

此解决方案不会过滤掉单个元素序列，但可以轻松实现。另外，这具有O（n）复杂性。如果你愿意，你也可以把它变成一台发电机。

通过可维护我的意思是代码不是300字符的单行，有一些复杂的表达式。那么也许你会想要使用Perl :)。至少你将了解一年后该功能的表现。

>>> x = [7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]
>>> print(group_by(x))
[[7, 8, 9, 10], [6], [0, 1, 2, 3, 4, 5]]

Answer 4

def igroups(L):
    R=[[]]
    [R[-1].append(L[i]) for i in range(len(L)) if (L[i-1]+1==L[i] if L[i-1]+1==L[i] else R.append([L[i]]))]
    return [P for P in R if len(P)>1]


tests=[[],
    [0, 0, 0],
    [7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5],
    [8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6],
    [9, 1, 2, 3, 1, 1, 2, 3, 5],
    [4,3,2,1,1,2,3,3,4,3],
    [1, 4, 3],
    [1],
    [1,2],
    [2,1]
    ]
for L in tests:
    print(L)
    print(igroups(L))
    print("-"*10)

输出以下内容：

[]
[]
----------
[0, 0, 0]
[]
----------
[7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]
[[7, 8, 9, 10], [0, 1, 2, 3, 4, 5]]
----------
[8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6]
[[8, 9, 10, 11], [1, 2, 3, 4, 5, 6]]
----------
[9, 1, 2, 3, 1, 1, 2, 3, 5]
[[1, 2, 3], [1, 2, 3]]
----------
[4, 3, 2, 1, 1, 2, 3, 3, 4, 3]
[[1, 2, 3], [3, 4]]
----------
[1, 4, 3]
[]
----------
[1]
[]
----------
[1, 2]
[[1, 2]]
----------
[2, 1]
[]
----------

EDIT 我使用itertools.groupby的第一次尝试是失败，对不起。

Answer 5

如果两个连续的数字增加一个，我会形成Certainty = Pm^2/(Pm^2 + Pb^2 + Pc^2)（list）group个数字。

当不增加且tuples（list）非空时，我将其解包并再次group以重建被{{打破的一对序列1}}。我使用zip理解来消除重复的数字。

zip

使用set更简单;

set

Answer 6

使用itertools.groupby，可以使用单行完成从L的相邻和增加的连续项的子列表中分割整数列表L的问题。然而，我不知道它是如何被认为是pythonic;）

以下是一些简单测试的代码：

[编辑：现在子序列正在增加增加1 ，我第一次错过了这一点。]

from itertools import groupby

def f(i):
    return  L[i-1]+1==L[i]


def igroups(L):
    return [[L[I[0]-1]]+[L[i] for i in I] for I in [I for (v,I) in [(k,[i for i in list(g)]) for (k, g) in groupby(range(1, len(L)), f)] if v]]

输出：

tests=[
    [0, 0, 0, 0],
    [7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5],
    [8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6],
    [9, 1, 2, 3, 1, 1, 2, 3, 5],
    [4,3,2,1,1,2,3,3,4,3],
    [1, 4, 3],
    [1],
    [1,2, 2],
    [2,1],
    [0, 0, 0, 0, 2, 5, 5, 8],
    ]
for L in tests:
    print(L)
    print(igroups(L))
    print('-'*10)


[0, 0, 0, 0]
[]
----------
[7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]
[[7, 8, 9, 10], [0, 1, 2, 3, 4, 5]]
----------
[8, 9, 10, 11, 7, 1, 2, 3, 4, 5, 6]
[[8, 9, 10, 11], [1, 2, 3, 4, 5, 6]]
----------
[9, 1, 2, 3, 1, 1, 2, 3, 5]
[[1, 2, 3], [1, 2, 3]]
----------
[4, 3, 2, 1, 1, 2, 3, 3, 4, 3]
[[1, 2, 3], [3, 4]]
----------
[1, 4, 3]
[]
----------
[1]
[]
----------
[1, 2, 2]
[[1, 2]]
----------
[2, 1]
[]
----------
[0, 0, 0, 0, 2, 5, 5, 8]
[]
----------

一些解释。如果您“展开”代码，逻辑会更加明确：

from itertools import groupby

def f(i):
    return L[i]==L[i-1]+1

def igroups(L):
    monotonic_states = [(k,list(g)) for (k, g) in groupby(range(1, len(L)), f)]
    increasing_items_indices = [I for (v,I) in monotonic_states if v]
    print("\nincreasing_items_indices ->", increasing_items_indices, '\n')
    full_increasing_items= [[L[I[0]-1]]+[L[i] for i in I] for I in increasing_items_indices]
    return full_increasing_items

L= [2, 8, 4, 5, 6, 7, 8, 5, 9, 10, 11, 12, 25, 26, 27, 42, 41]
print(L)
print(igroups(L))

输出：

[2, 8, 4, 5, 6, 7, 8, 5, 9, 10, 11, 12, 25, 26, 27, 42, 41]

increasing_items_indices -> [[3, 4, 5, 6], [9, 10, 11], [13, 14]]

[[4, 5, 6, 7, 8], [9, 10, 11, 12], [25, 26, 27]]

我们需要一个键函数f，用于将项目与给定列表中的前一个项目进行比较。现在，重点是具有键函数groupby的{{1}}函数提供了一个元组f，其中S表示来自初始列表的邻近索引，其中状态(k, S)是常量，状态由f的值给出：如果k是k，则True表示增加（按1）项指数其他非增加项目指数。（事实上，如上例所示，列表S不完整且缺少第一项）。

我还进行了一些带有一百万个项目列表的随机测试：S函数总是返回正确的响应，但比天真的实现慢4倍！更简单更容易，更快;）

感谢alvas提出的问题，它给了我很多乐趣！

Answer 7

一个（非常）简单的实现：

x = [7, 8, 9, 10, 6, 0, 1, 2, 3, 4, 5]
result = []
current = x[0]
temp = []
for i in xrange(1, len(x)):
    if (x[i] - current == 1):
        temp.append( x[i] )
    else:
         if (len(temp) > 1):
             result.append(temp)
         temp = [ x[i] ]
    current = x[i]
result.append(temp)

你会得到[ [7, 8, 9, 10], [0, 1, 2, 3, 4, 5] ]。在那里，您可以按[ len(x) for x in result ]和sum( len(x) for x in result)的总数来获得增加的数字。

Answer 8

我认为这很有效。它并不花哨，但很简单。它构造了一个开始列表sl和一个结束列表el，它应该始终具有相同的长度，然后使用它们索引到x：

def igroups(x):
    sl = [i for i in range(len(x)-1)
          if (x == 0 or x[i] != x[i-1]+1) and x[i+1] == x[i]+1]

    el = [i for i in range(1, len(x))
          if x[i] == x[i-1]+1 and (i == len(x)-1 or x[i+1] != x[i]+1)]

    return [x[sl[i]:el[i]+1] for i in range(len(sl))]

在列表中查找数量不断增加的组

EDITED

8 个答案: