SQL Server中的Over子句

时间:2013-01-04 19:27:17

标签: sql sql-server sql-server-2008 tsql analytic-functions

我有以下查询

select * from 
(
        SELECT distinct 
        rx.patid
       ,rx.fillDate
       ,rx.scriptEndDate
       ,MAX(datediff(day, rx.filldate, rx.scriptenddate)) AS longestScript
       ,rx.drugClass
       ,COUNT(rx.drugName) over(partition by rx.patid,rx.fillDate,rx.drugclass) as distinctFamilies
       FROM [I 3 SCI control].dbo.rx
       where rx.drugClass in ('h3a','h6h','h4b','h2f','h2s','j7c','h2e')
       GROUP BY rx.patid, rx.fillDate, rx.scriptEndDate,rx.drugName,rx.drugClass

) r
order by distinctFamilies desc

产生看起来像的结果 enter image description here

这应该意味着在表格中的两个日期之间应该有5个独特的药物名称。但是,当我运行以下查询时:

select distinct *
    from rx 
    where patid = 1358801781 and fillDate between '2008-10-17' and '2008-11-16' and drugClass='H4B'

我返回的结果集看起来像

enter image description here

您可以看到,虽然在2008-10-17和2009-01-15之间的第二个查询实际上返回了五行,但只有三个唯一的名称。我已经尝试了各种修改over子句的方法,所有方法都有不同程度的不成功。如何更改查询,以便在每行指定的时间范围内找到唯一 drugName

1 个答案:

答案 0 :(得分:3)

拍摄它:

   SELECT DISTINCT
  patid, 
  fillDate, 
  scriptEndDate, 
  MAX(DATEDIFF(day, fillDate, scriptEndDate)) AS longestScript,
  drugClass,
  MAX(rn) OVER(PARTITION BY patid, fillDate, drugClass) as distinctFamilies
FROM (
  SELECT patid, fillDate, scriptEndDate, drugClass,rx.drugName,
  DENSE_RANK() OVER(PARTITION BY patid, fillDate, drugClass ORDER BY drugName) as rn
  FROM [I 3 SCI control].dbo.rx
  WHERE drugClass IN ('h3a','h6h','h4b','h2f','h2s','j7c','h2e')
)x
GROUP BY x.patid, x.fillDate, x.scriptEndDate,x.drugName,x.drugClass,x.rn
ORDER BY distinctFamilies DESC

不确定DISTINCT是否真的有必要 - 因为你已经使用过它了。