postgresql中的第一个和最后一个值聚合函数,它们可以正确地使用NULL值

时间:2013-12-03 08:07:46

标签: sql postgresql aggregate-functions

我知道在postgresql

中有用于获取行的最后和第一个值的聚合函数

我的问题是,他们不能按我的需要工作。我可以使用一个postgresql向导的帮助。我正在使用postgresql 9.2 - 如果该版本使提供解决方案更容易。

查询

select v.id, v.active, v.reg_no, p.install_date, p.remove_date 
from vehicle v 
    left join period p on (v.id = p.car_id) 
where v.id = 1 
order by v.id, p.install_date asc

返回6行:

id, active, reg_no, install_date, remove_date
1, TRUE, something, 2008-08-02 11:13:39, 2009-02-09 10:32:32
....
1, TRUE, something, 2010-08-15 21:16:40, 2012-08-25 07:44:30
1, TRUE, something, 2012-09-10 17:05:12, NULL

但是当我使用聚合查询时:

select max(id) as id, last(active) as active, first(install_date) as install_date, last(remove_date) as remove_date 
from (
    select v.id, v.active, v.reg_no, p.install_date, p.remove_date 
    from vehicle v 
      left join period p on (v.id = p.car_id) 
    where v.id = 1 
    order by v.id, p.install_date asc
) as bar 
group by id

然后我得到

id, active, install_date, remove_date
1, TRUE, 2008-08-02 11:13:39, 2012-08-25 07:44:30

id, active, install_date, remove_date
1, TRUE, 2008-08-02 11:13:39, NULL

正如我预期的那样

如果最后一行的值为空,而不是最后一个现有值,是否可以以某种方式更改聚合函数以产生NULL?

EDIT1

Roman Pekar为我的问题提供了alternative solution,但这不符合我的需求。原因是 - 我简化了原始查询。但我运行的查询更复杂。我意识到可能有我的问题的替代解决方案 - 这就是为什么更新帖子以包括原始的,更复杂的查询。这是:

select partner_id, sum(active) as active, sum(installed) as installed, sum(removed) as removed 
from (
    select 
    pc.partner_id as partner_id, 
    v.id, 
    CASE WHEN v.active = TRUE THEN 1 ELSE 0 END as active, 
    CASE WHEN first(p.install_date) BETWEEN '2013-12-01' AND '2014-01-01' THEN 1 ELSE 0 END as installed,
    CASE WHEN last(p.remove_date) BETWEEN '2013-12-01' AND '2014-01-01' THEN 1 ELSE 0 END as removed 
    from vehicle v 
        left join period p on (v.id = p.car_id) 
        left join partner_clients pc on (pc.account_id = v.client_id) 
    group by pc.partner_id, v.id, v.active
) as foo group by partner_id

正如你所看到的,我实际上需要得到几辆车的第一个和最后一个值而不是一个,最后将这些车辆的车辆总数计算在一起。

/ EDIT1

2 个答案:

答案 0 :(得分:6)

您可以使用窗口函数lead() and lag()来检查第一个和最后一个记录,例如:

select
    max(a.id) as id,
    max(a.first) as first,
    max(a.last) as last
from (
    select
         v.id,
         case when lag(v.id) over(order by v.id, p.install_date) is null then p.install_date end as first,
         case when lead(v.id) over(order by v.id, p.install_date) is null then p.remove_date end as last
    from vehicle v 
       left join period p on (v.id = p.car_id) 
    where v.id = 1 
) as a

<强> sql fiddle demo

答案 1 :(得分:2)

感谢Damien,我去阅读有关创建函数(source)的postgresql文档,并使用更改它的函数进行了调整:

CREATE OR REPLACE FUNCTION public.last_agg ( anyelement, anyelement )
RETURNS anyelement LANGUAGE sql IMMUTABLE STRICT AS $$
        SELECT $2;
$$;

CREATE AGGREGATE public.last (
        sfunc    = public.last_agg,
        basetype = anyelement,
        stype    = anyelement
);

为:

CREATE OR REPLACE FUNCTION public.last_agg ( anyelement, anyelement )
RETURNS anyelement LANGUAGE sql IMMUTABLE CALLED ON NULL INPUT AS $$
        SELECT $2;
$$;

CREATE AGGREGATE public.last (
        sfunc    = public.last_agg,
        basetype = anyelement,
        stype    = anyelement
);

它似乎解决了我的麻烦。

感谢阅读。