在R

时间:2015-09-17 16:16:14

标签: r dataframe bootstrapping nls

我有一个数据框df

dput(df)
    structure(list(x = c(49, 50, 51, 52, 53, 54, 55, 56, 1, 2, 3, 
    4, 5, 14, 15, 16, 17, 2, 3, 4, 5, 6, 10, 11, 3, 30, 64, 66, 67, 
    68, 69, 34, 35, 37, 39, 2, 17, 18, 99, 100, 102, 103, 67, 70, 
    72), y = c(2268.14043972082, 2147.62290922552, 2269.1387550775, 
    2247.31983098201, 1903.39138268307, 2174.78291538358, 2359.51909126411, 
    2488.39004804939, 212.851575751527, 461.398994384333, 567.150629704352, 
    781.775113821961, 918.303706148872, 1107.37695799186, 1160.80594193377, 
    1412.61328924168, 1689.48879626486, 260.737164468854, 306.72700499362, 
    283.410379620422, 366.813913489692, 387.570173754128, 388.602676983443, 
    477.858510450125, 128.198042456082, 535.519377609133, 1028.8780498564, 
    1098.54431357711, 1265.26965941035, 1129.58344809909, 820.922447928053, 
    749.343583476846, 779.678206156474, 646.575242339517, 733.953282899613, 
    461.156280127354, 906.813018662913, 798.186995701282, 831.365377249207, 
    764.519073183124, 672.076289062505, 669.879217186302, 1341.47673353751, 
    1401.44881976186, 1640.27575962036)), .Names = c("x", "y"), row.names = c(NA, 
    -45L), class = "data.frame")

我根据我的数据集创建了两个非线性回归(nls1和nls2)。

library(stats)
    nls1 <- nls(y~A*(x^B)*(exp(k*x)), 
                data = df, 
                start = list(A = 1000, B = 0.170, k = -0.00295))
    nls2<-nls(y~A*x^3+B*x^2+C*x+D, data=df,
        start = list(A=0.02, B=-0.6, C= 50, D=200))

然后,我计算了这两个函数的bootstrap对象,以获得多组参数(nls1的A,B和k以及nls2的A,B,C和D)。

library(nlstools)
Boo1 <- nlsBoot(nls1, niter = 200)
Boo2 <- nlsBoot(nls2, niter = 200)

基于这个引导对象,我想计算每个参数组合的r平方,并将每个引导对象的r平方值的最小值,最大值和中值保存到一个新的数据帧中。数据框可能看起来像new.df

structure(list(Median = c(NA, NA), Max = c(NA, NA), Min = c(NA, 
NA)), .Names = c("Median", "Max", "Min"), row.names = c("nls1", 
"nls2"), class = "data.frame")

然后,我们的想法是根据自举对每个非线性模型的中位数,最小值和最大值进行一些箱形图比较。有人可以帮我解决这个问题吗?提前谢谢。

1 个答案:

答案 0 :(得分:1)

来自@bunk

的回答
stat <- function(dat, inds) { fit <- try(nls(y~A*(x^B)*(exp(k*x)), data = dat[inds,], start = list(A = 1000, B = 0.170, k = -0.00295)), silent=TRUE); f1 <- if (inherits(fit, "nls")) AIC(fit) else NA; fit2 <- try(nls(y~A*x^3+B*x^2+C*x+D, data = dat[inds,], start = list(A=0.02, B=-0.6, C= 50, D=200)), silent=TRUE); f2 <- if (inherits(fit2, "nls")) AIC(fit2) else NA; c(f1, f2) }; res <- boot(df, stat, R=200). Then, to get medians for example, apply(res$t, 2, median, na.rm=TRUE)