Question

我有这种格式的旧数据：

ID    DT          NUM 
1     6-1-2012    2
1     6-2-2012    2
1     6-3-2012    4
1     6-4-2012    4
1     6-5-2012    8
1     6-6-2012    8
1     6-7-2012    8
1     6-8-2012    16
1     6-9-2012    2
1     6-10-2012   2

我需要它看起来像这样：

ID    START_DT    END_DT      NUM
1     6-1-2012    6-2-2012    2 
1     6-3-2012    6-4-2012    4
1     6-5-2012    6-7-2012    8 
1     6-8-2012    6-8-2012    16
1     6-9-2012    6-10-2012   2

这是我能够快速提出的数据的最佳示例。如果我意外地包含一些误解，我想澄清一下。

规则：

ID：这确实会发生变化，最终会被分组，为了让事情变得简单，我的例子中也是如此。
DT：我得到一个原始日期时间，在实际数据中时间部分确实变化
START_DT，END_DT：我需要从原始DT中获取这些列
NUM：这只是一个更改的整数，可以重新发送每个ID

编辑：这很尴尬...... （必须有更好的答案）... 我还没有用很多条件测试过这个但从一开始它看起来没问题......并且必须手动查找并替换所有字段名称（善良）

select * from (
    select  *,row_number() over (partition by if_id, [z.num] order by if_id, [y.num]) as rownum

    from (
            select  y.id,
                    y.dt as [y.dt], 
                    z.dt as [z.dt],    
                    y.num

            from    #temp as y 

                    outer apply (select top 1 id, dt, num

                                    from    #temp as x 

                                    where   x.id = y.id and 
                                            x.dt > y.dtand 
                                            x.num <> y.num

                                    order by x.dt asc) as z   ) as x ) as k
where rownum=1
order by [y.dt]

Answer 1

select id,min(dt) as start_date, max(dt) as end_date, num
from whatevertablename_helps_if_you_supply_these_when_asking_for_code
group by 1,4

也可以将它作为子查询来获取最小值和子查询以获得最大值，但不要认为你需要在这里做。

我的回答是Postgres ...我认为你需要将group by语句改为id，num而不是t-sql。

添加：

你怎么知道它是

1 6-1-2012 6-2-2012 2

1 6-9-2012 6-10-2012 2

而不是

1 6-1-2012 6-10-2012 2

1 6-2-2012 6-9-2012 2

您需要更多业务规则来确定

Answer 2

select id, [y.dt] as start_dt, [z.dt] as end_dt, num from (
        select  *,row_number() over (partition by id, [z.dt] order by id, [y.dt]) as rownum

        from (
                select  y.id,
                        y.dt as [y.dt], 
                        z.dt as [z.dt],    
                        y.num

                from    #temp as y 

                        outer apply (select top 1 id, dt, num

                                        from    #temp as x 

                                        where   x.id = y.id and 
                                                x.dt > y.dt and 
                                                x.num <> y.num

                                        order by x.dt asc) as z   ) as x ) as k
where rownum=1
order by id, [y.dt]

这给了我们......（用不同的数据）

id     start_dt                 end_dt                         num
6      2011-10-01 00:00:00.000  2012-01-18 00:00:00.000        896
6      2012-01-18 00:00:00.000  2012-02-01 00:00:00.000        864
6      2012-02-01 00:00:00.000  NULL                           896

我在大约一个小时之前将它发布在顶部可能......？并说这很尴尬（而且草率）......我想知道是否有人有更好的答案，因为我很糟糕。但我不明白为什么人们不断发帖说他们需要更好的业务规则，并且需要知道如何处理某些情况。这段代码正是我想要的，除了end_dt是新num的日期时间而不是当前num的最后一次出现....但我可以使用它。总比没有好。（抱歉，沮丧）。

业务规则：数据已存在。它应该显示逻辑跨度。我需要数字的start_dt和end_dt ... 当NUM = Y时，开始日期是NUM从X变为Y而结束日期是Y变为Z的时间。我可以＆＃39; t给你的不仅仅是我自己所有这些......这些规则对我来说已经足够...... ??

好的，相同的数据：

 id      start_dt   end_dt       num
 1       6-1-2012   6-3-2012    2
 1       6-3-2012   6-5-2012    4
 1       6-5-2012   6-8-2012    8
 1       6-8-2012   6-9-2012    16
 1       6-9-2012   NULL        2

SQL：将表（ID，[Datetime]，[INT]）更改为（ID，Start_DT，End_DT，[INT]）

2 个答案: