在各组中汇总SAS中的数据

时间:2013-12-22 04:40:43

标签: sas

我的数据集采用以下格式:

NEWID
Age
H_PERS
Income
OCCU
FAMTYPE
REGION
Metro(Yes/No)
Exp_alcohol
population sample-(This is the weighted population each new id represents) etc.

我想生成如下的摘要视图:

average expenditure value (This should be sum of (exp_alcohol/population sample))

% of population sample across Region Metro and each demographic variable 

请帮助我解决你的想法。

1 个答案:

答案 0 :(得分:0)

由于我看不到您的数据集并且您的描述不是很清楚,我猜测您的数据看起来像这样,并且您想要添加一些新的变量来汇总您的数据......

data alcohol;
input NEWID Age H_PERS Income OCCU $ FAMTYPE $ REGION $ Metro $ 
Exp_alcohol population_sample;
datalines;
1234 32 4 65000 abc m CA Yes 2 4
5678 23 5 35000 xyz s WA Yes 3 6
9923 34 3 49000 def d OR No 3 9
8844 26 4 54000 gdp m CA No 1 5
;
run;

data summar;
    set alcohol;
    retain TotalAvg_expend metro_count total_pop;

    Divide = exp_alcohol/population_sample;
    TotalAvg_expend + Divide; 
    total_pop + population_sample;
    if metro = 'Yes' then metro_count + population_sample;
    percent_metro = (metro_count/total_pop)*100;
    drop NEWID Age H_PERS Income OCCU FAMTYPE REGION Divide;
run;

输出:

                  Exp_     population_    TotalAvg_    metro_    total_    percent_
       Metro    alcohol       sample        expend      count      pop       metro

        Yes        2            4          0.50000        4         4       100.000
        Yes        3            6          1.00000       10        10       100.000
        No         3            9          1.33333       10        19        52.632
        No         1            5          1.53333       10        24        41.667