要在7天内删除重复数据

时间:2019-01-03 11:00:55

标签: sql-server tsql

以下是我的表格和示例数据

DECLARE @Employee_Log table(ID int,eid int, ecode varchar(100), emp_startdate date)



  INSERT INTO @Employee_Log
SELECT 1, 1, 'aaa','2019-01-01'
UNION ALL
SELECT 2, 1, 'aaa','2019-01-05'
UNION ALL
SELECT 3, 1, 'bbb','2019-01-03'
UNION ALL
SELECT 4, 2, 'aaa','2019-01-03'
UNION ALL
SELECT 5, 1, 'aaa','2019-02-01'
UNION ALL
SELECT 6, 1, 'aaa','2019-02-15'
UNION ALL
SELECT 7, 1, 'aaa','2019-02-19'
UNION ALL
SELECT 8, 1, 'aaa','2019-02-28'

在上述数据中,我想删除基于eidecode的重复项。如果emp_startdate在7天内,请获取最新数据,并忽略其余数据。 / p>

我尝试了以下代码,但如何添加星期范围的条件检查

SELECT 
ROW_NUMBER() OVER(PARTITION BY eid,ecode ORDER BY emp_startdate desc) as rownum,
ID,eid,ecode,emp_startdate
FROM @Employee_Log

我想要如下所示的结果

  ID    eid ecode   emp_startdate
    2   1   aaa      2019-01-05
    5   1   aaa      2019-02-01
    4   2   aaa      2019-01-03
    7   1   aaa      2019-02-19
    8   1   aaa      2019-02-28
    3   1   bbb      2019-01-03

2 个答案:

答案 0 :(得分:1)

我仍然不确定如果在同一7天内发生2个以上的事件,您想做什么。但是,此解决方案将获得所有日期系列中的最新日期,其中日期之间的差值为7天或更短。

select ID,eid,ecode,emp_startdate
from
(
select ID,
eid,
ecode,
emp_startdate,
datediff(day
    ,emp_startdate
    ,lead(emp_startdate) 
        over
        (partition by eid,ecode order by emp_startdate)) l 
from  @Employee_Log
) a
where l is null or l>7

ID     eid     ecode     emp_startdate
--     ---     -----     -------------
3       1       bbb       2019-01-03
2       1       aaa       2019-01-05
5       1       aaa       2019-02-01
7       1       aaa       2019-02-19
8       1       aaa       2019-02-28
4       2       aaa       2019-01-03

答案 1 :(得分:0)

以下查询将为您提供使用简单英语获取的内容,但示例数据和所需的输出与您自己的问题矛盾:

SELECT *
FROM
(
    SELECT
        ROW_NUMBER() OVER (PARTITION BY eid , ecode , YEAR(emp_startdate) 
                                       , DATEPART(WEEK, emp_startdate)
                           ORDER BY emp_startdate DESC
                          ) AS rownum
      , ID
      , eid
      , ecode
      , emp_startdate
    FROM @Employee_Log
) x
WHERE x.rownum = 1;