获取命令重新创建比dput()更简单的对象

时间:2014-07-28 07:41:14

标签: r reproducible-research

dput的输出通常比用户输入创建相同对象的内容复杂得多。我知道这可能是保证100%再现性所必需的(例如,当不同的用户使用不同的默认设置时)。但是,它并没有使示例尽可能可读,而且我经常花一些时间来简化输出。

作为示例,请考虑:

dput(data.frame(a=1:10))
> structure(list(a = 1:10), .Names = "a", row.names = c(NA, -10L), class = "data.frame")

是否还有dput的替代方案只会返回data.frame(a=1:10)

1 个答案:

答案 0 :(得分:3)

以下是data.frames的部分实现(没有row.names或其他属性):

dput2 <- function(x, ...) UseMethod("dput2")
dput2.data.frame <- function(x, ...) {
    fun <- function(nm) paste(nm, "=", 
            paste(capture.output(dput(x[[nm]], file = stdout())), collapse = ""))
    L <- lapply(names(x), fun)
    cat(paste("data.frame(", paste(unlist(L), collapse = ",\n"), ")"), "\n")
}

例如,

> dput2(BOD)
data.frame( Time = c(1, 2, 3, 4, 5, 7),
demand = c(8.3, 10.3, 19, 16, 15.6, 19.8) ) 

> dput2(anscombe)
data.frame( x1 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x2 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x3 = c(10, 8, 13, 9, 11, 14, 6, 4, 12, 7, 5),
x4 = c(8, 8, 8, 8, 8, 8, 8, 19, 8, 8, 8),
y1 = c(8.04, 6.95, 7.58, 8.81, 8.33, 9.96, 7.24, 4.26, 10.84, 4.82, 5.68),
y2 = c(9.14, 8.14, 8.74, 8.77, 9.26, 8.1, 6.13, 3.1, 9.13, 7.26, 4.74),
y3 = c(7.46, 6.77, 12.74, 7.11, 7.81, 8.84, 6.08, 5.39, 8.15, 6.42, 5.73),
y4 = c(6.58, 5.76, 7.71, 8.84, 8.47, 7.04, 5.25, 12.5, 5.56, 7.91, 6.89) )