重塑数据以与geeglm()一起使用

时间:2012-09-08 01:29:12

标签: r

你能帮我弄明白我收到错误的原因吗?

最初我的数据如下所示:

> attributes(compl)$names
 [1] "UserID"         "compl_bin"      "Sex.x"          "PHQ_base"       "PHQ_Surv1"      "PHQ_Surv2"      "PHQ_Surv3"    
 [8] "PHQ_Surv4"      "EFE"            "Neuro"          "Intervention.x" "depr0"          "error1_1.x"     "error1_2.x"   
[15] "error1_3.x"     "error1_4.x"     "stress0"        "stress1"        "stress2"        "stress3"        "stress4"      
[22] "hours1"         "hours2"         "hours3"         "hours4"         "subject"       

首先,我重塑我的数据以准备geeglm:

compl$subject <- factor(rownames(compl))
nobs <- nrow(compl) 
compl_long <- reshape(compl, idvar = "subject",
                      varying = list(c("PHQ_Surv1", "PHQ_Surv2" ,
                                       "PHQ_Surv3", "PHQ_Surv4"), 
                                     c("error1_1.x", "error1_2.x",
                                       "error1_3.x", "error1_4.x"), 
                                     c("stress1", "stress2", "stress3",
                                       "stress4"), 
                                     c("hours1", "hours2", "hours3",
                                       "hours4")), 
                      v.names = c("PHQ", "error", "stress", "hours"),
                      times = c("1", "2", "3", "4"), direction = "long")

- (编者注:不确定下一个输出来自...)

 [1] "UserID"         "compl_bin"      "Sex.x"          "PHQ_base"       "EFE"            "Neuro"          "Intervention.x"
 [8] "depr0"          "stress0"        "subject"        "time"           "PHQ"            "error"          "stress"       
[15] "hours" 

然后我使用geeglm函数:

library(geepack)

geeSand=(geeglm(PHQ~as.factor(compl_bin) + Neuro+PHQ_base+as.factor(depr0) +
                    EFE+as.factor(Sex.x) + as.factor(error)+stress+hours,
                    family = poisson, data=compl_long,
                    id=subject, corst="exchangeable"))

我收到错误:

"Error in geese.fit(xx, yy, id, offset, soffset, w, waves = waves, zsca,  : 
  nrow(zsca) and length(y) not match"

如果我删除变量as.factor(错误)和小时,geeglm不会抱怨,我得到输出。该函数不适用于error和hours变量。我检查所有变量的长度,它们是相等的。你能帮我弄清楚出了什么问题吗?

非常感谢!

1 个答案:

答案 0 :(得分:1)

发现于:https://stat.ethz.ch/pipermail/r-help/2008-October/178337.html

” 我很确定这是鹅()中的一个错误,应该报告给 该 geepack的维护者。问题在于缺失的治疗方法 值。

如果看一下暗淡(na.omit(dat [,c(“id”,“score”,“chem”,“time”)]))) 得到44。 在geese.fit()中,zsca被设置为等于矩阵(1,N,1),其中N被设置为
等于 长度(ID)。但id的长度为46,而响应y为
修剪 通过消除任何数据的任何行来减去长度44 变量 涉及失踪。因此有问题。

问题的解决方案需要一些代码重写 geepack的维护者。“