R中并行计算的stdout和stderr

时间:2013-04-23 17:40:19

标签: r parallel-processing stdout stderr

我使用包parallel进行计算。这是一个玩具示例:

library(parallel)
m = matrix(c(1,1,1,1,0.2,0.2,0.2,0.2), nrow=2)
myFun = function(x) {
  if (any(x<0.5)) {
    write("less than 0.5", stderr())
    return(NA)
  } else {
    write("good", stdout())
    return(mean(x))
  }
}
cl = makeCluster(2, outfile="/tmp/output")
parApply(cl, m, 2, myFun)
stopCluster(cl)

问题是stdout和stderr将被重定向到/tmp/outputoutput文件如下所示:

starting worker pid=51083 on localhost:11953 at 11:37:12.966
starting worker pid=51093 on localhost:11953 at 11:37:13.261
good
good
less than 0.5
less than 0.5

有没有办法分别为stdout和stderr设置两个单独的文件?以及如何忽略“起始工人pid = ...”的前两行?

1 个答案:

答案 0 :(得分:3)

parallel包不直接支持将stdout和stderr发送到单独的文件,但您可以自己完成:

cl = makeCluster(2)

setup = function(outfile, errfile) {
  assign("outcon", file(outfile, open="a"), pos=.GlobalEnv)
  assign("errcon", file(errfile, open="a"), pos=.GlobalEnv)
  sink(outcon)
  sink(errcon, type="message")
}

shutdown = function() {
  sink(NULL)
  sink(NULL, type="message")
  close(outcon)
  close(errcon)
  rm(outcon, errcon, pos=.GlobalEnv)
}

clusterCall(cl, setup, "/tmp/output", "/tmp/errmsg")
parApply(cl, m, 2, myFun)
clusterCall(cl, shutdown)

由于在调用setup之前发出了“起始工作者”消息,因此这些消息被重定向到“/ dev / null”,这是未指定outfile时的默认行为。 / p>