我有以下几行:
CREATE TABLE #TEMP (id int, name varchar(255), startdate datetime, enddate datetime)
INSERT INTO #TEMP VALUES(1, 'John', '2011-01-11 00:00:00.000','2011-01-11 00:01:10.000')
INSERT INTO #TEMP VALUES(2, 'John', '2011-01-11 00:00:20.000','2011-01-11 00:01:05.000')
INSERT INTO #TEMP VALUES(3, 'John', '2011-01-11 00:01:40.000','2011-01-11 00:01:50.000')
INSERT INTO #TEMP VALUES(4, 'Adam', '2011-01-11 00:00:40.000','2011-01-11 00:01:20.000')
INSERT INTO #TEMP VALUES(5, 'Adam', '2011-01-11 00:00:45.000','2011-01-11 00:01:15.000')
SELECT * FROM #TEMP
DROP TABLE #TEMP
我正在尝试删除其他日期中包含日期的记录,以获取以下内容:
John 2011-01-11 00:00:00.000 2011-01-11 00:01:10.000
John 2011-01-11 00:01:40.000 2011-01-11 00:01:50.000
Adam 2011-01-11 00:00:40.000 2011-01-11 00:01:20.000
关于如何为大约100K行的表实现此目的的任何建议?
答案 0 :(得分:2)
这给出了期望的结果:
DELETE T1 FROM #TEMP T1
WHERE EXISTS(
SELECT NULL FROM #TEMP T2
WHERE t1.id <> t2.id
AND t1.name = t2.name
AND t1.startdate >= t1.startdate
AND t1.enddate <= t1.enddate
)
http://msdn.microsoft.com/en-us/library/ms188336.aspx
编辑:我刚注意到有一个问题。如果存在重复(相同的开始和结束),则两者都将被删除(没有约翰的方法,即使只有一个相同的日期)。所以你需要考虑到这一点:
DELETE T1 FROM #TEMP T1
WHERE EXISTS(
SELECT NULL FROM #TEMP T2
WHERE t1.id <> t2.id
AND t1.name = t2.name
AND t1.startdate > t2.startdate
AND t1.enddate < t2.enddate
OR t1.id <> t2.id
AND t1.name = t2.name
AND t1.startdate = t2.startdate
AND t1.enddate < t2.enddate
OR t1.id <> t2.id
AND t1.name = t2.name
AND t1.startdate > t2.startdate
AND t1.enddate = t2.enddate
OR t1.id > t2.id
AND t1.name = t2.name
AND t1.startdate = t2.startdate
AND t1.enddate = t2.enddate
)
答案 1 :(得分:2)
DELETE t1 FROM #TEMP t1
INNER JOIN #TEMP t2 ON t2.startdate < t1.startdate AND t1.enddate < t2.enddate
AND t1.name = t2.name
结果匹配