Question

这是我的代码：

from itertools import tee, islice, chain

def previous_and_next(some_iterable):
   prevs, items, nexts = tee(some_iterable, 3)
   prevs = chain([None], prevs)
   nexts = chain(islice(nexts, 1, None), [None])
   return zip(prevs, items, nexts)

fruits = ['watermelon', 'apple', 'apple', 'banana', 'kiwi', 'peach', 'apple',
          'pear', 'watermelon', 'apple', 'apple', 'orange', 'apple', 'grape']

nr_of_apples = 0
apples = []

for previous, item, nxt in previous_and_next(fruits):
    apple_indexes = [i for i, x in enumerate(fruits) if x == 'apple' and nxt != 'apple']
print(apple_indexes)

for i in apple_indexes:
    index = i - 1
    for previous, item, nxt in previous_and_next(fruits[index:]):
        if nxt != 'apple':
            break
        apples.append(nxt)

nr_of_apples = len(apples)

print(nr_of_apples)

我正在尝试使用itertools计算“苹果”一词在列表中出现的次数。我知道这是一种可以通过以下更为简单的方式完成工作的复杂方法：

for f in fruits:
    if f == 'apple':
        apples.append(f)

但是这里的想法是将此代码扩展为更广泛地用于Stanford CoreNLP的命名实体识别。因此，我从简单开始，逐步建立自己的方式。

问题是我的代码当前正在返回此：

[1, 2, 6, 9, 10, 12]  # indexes of the apples
8  # number of apples

很明显，列表中没有8个苹果，只有6个。所以我的问题是，如何向枚举中添加条件函数仅获取不跟随另一个苹果的苹果的索引？因此输出应如下所示：

[1, 6, 9, 12]
6

Answer 1

尝试这样的事情，

In [160]: list_of_index = [i for i,j in enumerate(fruits) if j == 'apple']

In [161]: print list(set([min(i) if i[1] - i[0] == 1 else max(i) for i in zip(list_of_index,list_of_index[1:])]))
[1, 12, 6, 9]

In [162]: print fruits.count('apple')
6

对带有枚举的条件使用itertools仅获取某些列表索引（python）

1 个答案: