Question

我想用置信区间计算我的数据中的一组组（例如年龄，性别）的摘要统计。为此，我使用泊松分布中的蒙特卡罗模拟绘图值来获取数据中的每一行，然后折叠行以获得摘要。如果模拟的结果只是一个值（使用rclass中的返回标量），整个过程工作正常，但是一旦我尝试模拟多个结果（使用eclass中的ereturn矩阵）它就不起作用（参见下面的Stata代码））。我收到错误消息：＆＃34;在表达式中键入不匹配错误：e（A）＆＃34;。如何在没有更复杂的循环等的情况下模拟整个矢量或甚至结果矩阵？

非常感谢！佛瑞德

program bootPGW, eclass
use "C:\Users\649007\Desktop\Demetriq_PWG_edu.dta", replace
gen id=_n
sort id
gen N=_N
by id: gen    DL2=floor(rpoisson(calung))
by id:gen     D02=floor(rpoisson(D0))
by id:gen     Dsmoking=floor(rpoisson(smoking))
by id:gen     ML2=(DL2/numpyr)*1000
by id:gen     AL2=(ML2-CPSIIrate)/ML2
by id:replace AL2=0 if AL2<0
by id:gen     A02=1-exp(-PWGcoef*(ML2-CPSIIrate))
by id:gen     A2=(AL2*DL2+A02*D02)/(DL2+D02)
gen Adeaths=totdeath*A2
collapse (sum) Adeaths=Adeaths totdeath=totdeath Dsmoking=Dsmoking, by(edu_3cat sex country year)
gen AF_PWG=Adeaths/totdeath
gen AF_simple=Dsmoking/totdeath
mkmat AF_PWG, matrix(A)
ereturn matrix A=A
end

simulate a=e(A), reps(1000) nodots seed(123): bootPGW

Answer 1

关键部分是您可以使用e(b)在ereturn post中返回矩阵，在这种情况下，您可以使用_b中的快捷simulate。

除此之外我稍微清理了一下代码：你不需要按_n执行它，因为它与常规generate相同。你也不需要围绕floor()的{{1}}函数，因为它已经返回整数。您没有使用N，因此您不需要它，但即使您这样做，您也可以更好地将其存储为本地宏（或者如果您更喜欢标量）而不是变量（数据集中的列），因为它是单个数字，因此将其存储在变量中只会浪费内存。

rpoisson()

Stata中整个矩阵的Monte Carlo模拟

1 个答案: