Java 8流多个分组依据

时间:2019-01-10 15:38:35

标签: java java-8 java-stream

我有这样的温度记录

dt        |AverageTemperature |AverageTemperatureUncertainty|City   |Country |Latitude|Longitude
----------+-------------------+-----------------------------+-------+--------+--------+---------
1963-01-01|-5.417000000000002 |0.5                          |Karachi|Pakistan|57.05N  |10.33E  
1963-02-01|-4.7650000000000015|0.328                        |Karachi|Pakistan|57.05N  |10.33E  
1964-01-01|-5.417000000000002 |0.5                          |Karachi|Pakistan|57.05N  |10.33E  
1964-02-01|-4.7650000000000015|0.328                        |Karachi|Pakistan|57.05N  |10.33E  
1965-01-01|11.417000000000002 |0.5                          |Karachi|Pakistan|57.05N  |10.33E 
1965-02-01|12.7650000000000015|0.328                        |Karachi|Pakistan|57.05N  |10.33E

我必须将其解析为POJO并根据以下问题陈述计算平均增量:

  

使用Streams API计算平均年温度变化量   对于每个国家。计算三角洲1900年的平均温度   从1901年的平均温度中减去以获得   特定城市从1900年到1901年之间的差额。所有的平均值   这些增量是城市的年平均温度增量。的   一个国家所有城市的平均值就是一个国家的平均值。

我的温带POJO看起来像是有吸气剂和吸气剂

public class Temperature {
    private java.util.Date date;
    private double averageTemperature;
    private double averageTemperatureUncertainty;
    private String city;
    private String country;
    private String latitude;
    private String longitude;
}

我已经列出了温度列表,因为要使用流来解决此问题。

要计算增量,我尝试使用以下流,但由于要计算平均国家/地区增量,我仍然无法计算实际的增量,因此我已经对国家,城市和日期进行了分组。

Map<String, Map<String, Map<Integer, Double>>> countriesMap = this.getTemperatures().stream()
                .sorted(Comparator.comparing(Temperature::getDate))
                .collect(Collectors.groupingBy(Temperature::getCountry,
                        Collectors.groupingBy(Temperature::getCity,
                        Collectors.groupingBy
                                (t -> {
                                            Calendar calendar = Calendar.getInstance();
                                            calendar.setTime(t.getDate());
                                            return calendar.get(Calendar.YEAR);
                                        }, 
                        Collectors.averagingDouble(Temperature::getAverageTemperature)))));

为了计算增量,我们必须计算差异 Map<Integer, Double>

为了计算差异,我想出了以下代码,但无法将以下代码与上面的代码连接

Stream.of(10d, 20d, 10d) //this is sample data that I that I get in `Map<Integer, Double>` of countriesMap
        .map(new Function<Double, Optional<Double>>() {
            Optional<Double> previousValue = Optional.empty();
            @Override
            public Optional<Double> apply(Double current) {
                Optional<Double> value = previousValue.map(previous -> current - previous);
                previousValue = Optional.of(current);
                return value;
            }
        })
        .filter(Optional::isPresent)
        .map(Optional::get)
        .forEach(System.out::println);

如何一次性使用流来计算增量,或者如何在countriesMap上执行流操作以计算增量并实现所提到的问题陈述。

1 个答案:

答案 0 :(得分:4)

要将问题陈述缩减为一个较小的块,您可以考虑使用的另一种方法是解析year温度,并为它们计算增量,然后进一步average对其进行解析。不过,对于您的问题中内部Map<Integer, Double>中所有类型Map的值,都必须这样做。看起来像这样:

Map<Integer, Double> unitOfWork = new HashMap<>(); // innermost map you've attained ('yearToAverageTemperature' map)
unitOfWork = unitOfWork.entrySet()
        .stream()
        .sorted(Map.Entry.comparingByKey())
        .collect(Collectors.toMap(Map.Entry::getKey, Map.Entry::getValue, (e1, e2) -> e1, LinkedHashMap::new));
// the values sorted based on the year from a sorted map
List<Double> srtedValPerYear = new ArrayList<>(unitOfWork.values());
// average of deltas from the complete list 
double avg = IntStream.range(0, srtedVal.size() - 1)
        .mapToDouble(i -> (srtedVal.get(i + 1) - srtedVal.get(i)))
        .average().orElse(Double.NaN);

要进一步注意,这只是City的{​​{1}}记录的平均值,您将不得不遍历所有<Year, AverageTemperature>键集,并且类似地遍历所有{{ 1}}键集,以彻底找出此类平均值。

将此工作单元进一步移动到方法中,遍历整个地图,这可以通过以下方式实现:

City

其中Country是遵循的方法

// The average of all cities in a country is the average of a country.
AtomicReference<Double> countryValAvg = new AtomicReference<>(0.0);
countriesMap.forEach((country, cityMap) -> {
    // The average of all these deltas is the average annual temperature delta for a city.
    AtomicReference<Double> cityAvgTemp = new AtomicReference<>((double) 0);
    cityMap.forEach((city, yearMap) -> cityAvgTemp.set(cityAvgTemp.get() + averagePerCity(yearMap)));
    double avgAnnualTempDeltaPerCity = cityAvgTemp.get() / cityMap.size();

    countryValAvg.set(countryValAvg.get() + avgAnnualTempDeltaPerCity);
});
System.out.println(countryValAvg.get() / countriesMap.size());

注意 :上面的代码可能缺少验证,它只是为了提供一个想法,如何将完整的问题分解成较小的部分然后加以解决。

Edit1 :其中could be improved further as

averagePerCity

Edit2 :进一步

double averagePerCity(Map<Integer, Double> unitOfWork) {
    unitOfWork = unitOfWork.entrySet()
            .stream()
            .sorted(Map.Entry.comparingByKey())
            .collect(Collectors.toMap(Map.Entry::getKey, Map.Entry::getValue, (e1, e2) -> e1, LinkedHashMap::new));
    List<Double> srtedVal = new ArrayList<>(unitOfWork.values());
    return IntStream.range(0, srtedVal.size() - 1)
            .mapToDouble(i -> (srtedVal.get(i + 1) - srtedVal.get(i)))
            .average().orElse(Double.NaN);
}