Question

使用流API，如何在使用groupingBy - ＆gt; counting操作收集后根据出现次数的过滤器包含条目进行过滤？

鉴于以下内容：

Map<Integer, Long> counts = Stream.of(1, 2, 2, 3, 4, 5, 5)
        .collect(groupingBy(n -> n, counting()));

如何对此进行过滤以仅包含键2和5？

我可以使用以下内容，但我希望能够继续使用流而不需要先收集。

Map<Integer, Long> counts = Stream.of(1, 2, 2, 3, 4, 5, 5)
        .collect(groupingBy(n -> n, counting()))
        .entrySet().stream()
        .filter(n -> n.getValue() > 1)
        .collect(toMap(Entry::getKey, Entry::getValue));

Answer 1

无法为依赖于已经看到的值的操作构建地图或类似的数据结构。这与例如distinct相同，它看起来像是操作链中的一个步骤，但如果没有在内部构建地图（或类似地图的结构）就无法工作。

您可以使用

使整个操作看起来像单个流操作

Map<Integer, Long> counts = Stream.of(1, 2, 2, 3, 4, 5, 5)
    .collect(collectingAndThen(groupingBy(n -> n, counting()),
       map -> map.entrySet().stream()
         .filter(n -> n.getValue() > 1)
         .collect(toMap(Entry::getKey, Entry::getValue))
    ));

但这不会改变它的工作方式。请记住，必须首先记住每个遇到的值，因为它需要到达流的末尾以推断出不存在其他值的值。

请注意，有时非流操作可能看起来更简洁：

Map<Integer, Long> counts = Stream.of(1, 2, 2, 3, 4, 5, 5)
    .collect(groupingBy(n -> n, HashMap::new, counting()));
counts.values().removeIf(count -> count < 2);

如果您有兴趣在（并行能力）Stream内处理项目本身而不关心实际发生次数，这是一个简单的解决方案：

ConcurrentHashMap<Integer,Integer> counts=new ConcurrentHashMap<>();
Stream.of(1, 2, 2, 3, 4, 5, 5)
      .filter(i -> counts.merge(i, 1, Integer::sum)==2)
      .forEach(System.out::println);

只要遇到第二项，就可以将后续操作应用到终端操作，而无需处理所有项目或等待流的结束并与{{1}协调一致执行和/或短路操作，如parallel或limit等。

如何根据事件计数过滤条目？

1 个答案: