SQL Server根据分组在不同行和总和之间查找datediff

时间:2018-01-08 15:45:21

标签: sql sql-server

我希望在此处扩展接受的答案:

SQL Server find datediff between different rows, sum

扩展数据集:

ID  | Time                    |OnOffSite| UserName   | Reason
------------------------------------------------------
123 | 2011-10-25 09:00:00.000 | on      | Bloggs Joe | NULL
124 | 2011-10-25 12:00:00.000 | off     | Bloggs Joe | Shift
125 | 2011-10-25 13:00:00.000 | on      | Bloggs Joe | NULL
126 | 2011-10-25 17:00:00.000 | off     | Bloggs Joe | Travel
127 | 2011-10-25 09:00:00.000 | on      | Jonesy Ian | NULL
128 | 2011-10-25 10:00:00.000 | on      | Jonesy Ian | NULL
129 | 2011-10-25 11:00:00.000 | off     | Jonesy Ian | End Shift
130 | 2011-10-25 12:00:00.000 | on      | Jonesy Ian | NULL
131 | 2011-10-25 15:00:00.000 | off     | Jonesy Ian | OverTime

我尝试通过新专栏扩展提供的答案分组,但没有运气:

-- =====================
-- sample data
-- =====================
declare @t table
(
    ID int,
    Time datetime,
    OnOffSite varchar(3),
    UserName varchar(50),
    Reason varchar(50)
)

insert into @t values(123, '2011-10-25 09:00:00.000', 'on', 'Bloggs Joe', 'NULL')
insert into @t values(124, '2011-10-25 12:00:00.000', 'off', 'Bloggs Joe', 'Shift')
insert into @t values(125, '2011-10-25 13:00:00.000', 'on', 'Bloggs Joe', 'NULL')
insert into @t values(126, '2011-10-25 17:00:00.000', 'off', 'Bloggs Joe', 'Travel')
insert into @t values(127, '2011-10-25 09:00:00.000', 'on', 'Jonesy Ian', 'NULL')
insert into @t values(128, '2011-10-25 10:00:00.000', 'on', 'Jonesy Ian', 'NULL')
insert into @t values(129, '2011-10-25 11:00:00.000', 'off', 'Jonesy Ian', 'Travel')
insert into @t values(130, '2011-10-25 12:00:00.000', 'on', 'Jonesy Ian', '')
insert into @t values(131, '2011-10-25 15:00:00.000', 'off', 'Jonesy Ian', 'Shift')

-- =====================
-- solution
-- =====================
select
    UserName, Reason,  diffinhours = DATEDIFF(hh, timeon, timeoff)
from
(
    select
        UserName,
        Reason, 
        timeon = max(case when k = 2 and OnOffSite = 'on' then Time end),
        timeoff = max(case when k = 1 and OnOffSite = 'off' then Time end)
    from
    (
        select
            ID,
            UserName,
            Reason, 
            OnOffSite,
            Time,
            rn = ROW_NUMBER() over(partition by username, Reason,  order by id)
        from
        (
            select
                ID,
                UserName,
                Reason, 
                OnOffSite,
                Time,
                rn2 = case OnOffSite 
                -- '(..order by id)' takes earliest 'on' in the sequence of 'on's
                -- to take the latest use '(...order by id desc)'
                when 'on' then 
                    ROW_NUMBER() over(partition by UserName, Reason,  OnOffSite, rn1 order by id)
                -- '(... order by id desc)' takes the latest 'off' in the sequence of 'off's
                -- to take the earliest use '(...order by id)'
                when 'off' then
                    ROW_NUMBER() over(partition by UserName, Reason,  OnOffSite, rn1 order by id desc)
                end,
                rn1
            from
            (
                select
                    *,
                    rn1 = ROW_NUMBER() over(partition by username, Reason,  order by id) +
                        ROW_NUMBER() over(partition by username, Reason, onoffsite order by id desc)
                from @t
            ) t
        ) t
        where rn2 = 1
    ) t1
    cross join
    (
        select k = 1 union select k = 2
    ) t2
    group by UserName, Reason, rn + k
) t
where timeon is not null or timeoff is not null
order by username

我只是在OnOffSite =" off"时记录了一个原因。我相信问题是当OnOffSite =" on"时,Reason的值为NULL。但在这种情况下,我只想按时间戳分组并使用" off"原因作为价值。

这是SQL Server 2012

1 个答案:

答案 0 :(得分:1)

从分组中移除Reason并将第32行的Reason替换为reason = max(case when OnOffSite = 'on' then null else Reason end),以便代码如下所示:

[...................................]

-- =====================
-- solution
-- =====================
select
    UserName, timeon, timeoff, diffinhours = DATEDIFF(hh, timeon, timeoff), reason
from
(
    select
        UserName,
        reason = max(case when OnOffSite = 'on' then null else Reason end),
        timeon = max(case when k = 2 and OnOffSite = 'on' then Time end),
        timeoff = max(case when k = 1 and OnOffSite = 'off' then Time end)
    from
    (
        select
            ID,
            UserName,
            OnOffSite,
            Time,
            rn = ROW_NUMBER() over(partition by username order by id),
            Reason

[...................................]

将max与字符串一起使用并不完全是本书,但它适用于这种情况。