Question

我在r中使用gbm来预测生存（分布=“coxph”）。

当gbm.predict（....，type =“response”）的预测值大约在[-0.001到0.5]之间。

如何在不存在0到1（[0,1]）风险的情况下解释新样本的风险。

Answer 1

如果你看一下bazehaz.gbm，你会看到gbm.predict给出lambda_0（t）。

The proportional hazard model assumes h(t|x)=lambda(t)*exp(f(x)).
gbm can estimate the f(x) component via partial likelihood.
After estimating f(x), basehaz.gbm can compute the a nonparametric estimate of lambda(t).

所以你可以这样（我希望）：

model = gbm(Surv(durata, status2015) ~ .-fold, data= ...)
XB =predict.gbm(model, n.trees = ..., type = "response")

lambda0 = basehaz.gbm(t = data$time, delta = data$censoring,  t.eval =   sort(unique(data$time)), cumulative = FALSE, f.x = XB , smooth=T)
XBnew =predict.gbm(model, n.trees = ..., data=newData, type = "response")

hazard = h(t|x)= lambda0*exp(XBnew).

如果你正在寻找生存功能......我正在研究它。：）

P.S。：在估计XB思想时有一种奇怪的行为......因为它随着树木的数量而显着变化。：（

r中的GBM用于coxph损失函数

1 个答案: