我有500万个观测数据集,感兴趣的三个变量是州代码,县代码和道路代码。现在,我需要在将三个变量放在一起时找到不同观察的总计数。通常,人们可以通过以下方式找到特定变量的独特计数:
PROC sql;
SELECT COUNT(DISTINCT(TRACT))
AS COUNT FROM metro1_00;
quit;
但我需要像(道和县和州)一样的独特计数。但不幸的是,这些选项不可用。我正在使用SAS 9.3。
答案 0 :(得分:1)
您只需要使用group by
声明
proc sql noprint;
create table new_table as
select distinct count(tract) as COUNT_t
from metro1_00
group by tract, county, state
;
quit;
答案 1 :(得分:0)
你不能分组吗?
SELECT TRACT, COUNTY, STATE, count(*) FROM metro1_00
GROUP BY TRACT, COUNTY, STATE
如果您真的只想要TRACT,COUNTY,STATE组合的区别,您可以使用子查询/内联视图。它看起来像是supports these.
SELECT count(*)
FROM (SELECT DISTINCT TRACT, COUNTY, STATE FROM metro1_00)
as subq(TRACT, COUNTY, STATE)