使用循环在R中进行卡方检验

时间:2013-09-17 11:45:39

标签: r chi-squared

我是R.的新手。我找到了以下代码,用于对一组变量进行单变量逻辑回归。我想做的是运行卡方检验以获得针对因变量的变量列表,类似于下面的逻辑回归代码。我发现其中有几个涉及创建所有可能的变量组合,但我无法让它工作。理想情况下,我希望其中一个变量(X)是相同的。

Chi Square Analysis using for loop in R

lapply(c("age","sex","race","service","cancer",
         "renal","inf","cpr","sys","heart","prevad",
         "type","frac","po2","ph","pco2","bic","cre","loc"),

       function(var) {

         formula    <- as.formula(paste("status ~", var))
         res.logist <- glm(formula, data = icu, family = binomial)

         summary(res.logist)
       })

1 个答案:

答案 0 :(得分:1)

您确定lapply过的向量中的字符串是否在icu数据集的列名中?

下载icu数据时,它适用于我:

system("wget http://course1.winona.edu/bdeppa/Biostatistics/Data%20Sets/ICU.TXT")
icu <- read.table('ICU.TXT', header=TRUE)

并将status更改为STA,这是icu中的一列。这是一些变量的示例:

my.list <- lapply(c("Age","Sex","Race","Ser","Can"),         
       function(var) {
         formula    <- as.formula(paste("STA ~", var))
         res.logist <- glm(formula, data = icu, family = binomial)
         summary(res.logist)
       })

这给了我一个包含summary.glm个对象的列表。例如:

lapply(my.list, coefficients)
[[1]]
               Estimate Std. Error   z value     Pr(>|z|)
(Intercept) -3.05851323 0.69608124 -4.393903 1.113337e-05
Age          0.02754261 0.01056416  2.607174 9.129303e-03

[[2]]
              Estimate Std. Error    z value     Pr(>|z|)
(Intercept) -1.4271164  0.2273030 -6.2784758 3.419081e-10
Sex          0.1053605  0.3617088  0.2912855 7.708330e-01

[[3]]
              Estimate Std. Error    z value   Pr(>|z|)
(Intercept) -1.0500583  0.4983146 -2.1072198 0.03509853
Race        -0.2913384  0.4108026 -0.7091933 0.47820450

[[4]]
              Estimate Std. Error   z value     Pr(>|z|)
(Intercept) -0.9465961  0.2310559 -4.096827 0.0000418852
Ser         -0.9469461  0.3681954 -2.571858 0.0101154495

[[5]]
                 Estimate Std. Error       z value     Pr(>|z|)
(Intercept) -1.386294e+00  0.1863390 -7.439638e+00 1.009615e-13
Can          7.523358e-16  0.5892555  1.276756e-15 1.000000e+00

如果你想进行卡方检验:

my.list <- lapply(c("Age","Sex","Race","Ser","Can"),function(var)chisq.test(icu$STA, icu[,var]))

或所有变量组合的卡方检验:

my.list.all <- apply(combn(colnames(icu), 2), 2, function(x)chisq.test(icu[,x[1]], icu[,x[2]]))

这有用吗?