R数值到char精度损失

时间:2018-11-13 11:03:03

标签: r type-conversion precision

我想将多位数字矢量转换为字符。我尝试了以下解决方案here,该解决方案只能用于一个数字,而不能用于矢量。没关系

options(digits=20)
options(scipen=99999)
x<-129483.19999999999709;format(round(x, 12), nsmall = 12)
[1] "129483.199999999997"

但这不是。如何保持数字矢量字符的数字精度?

> y <- c(129483.19999999999709, 1.3546746874,687676846.2546746464)

687676846.2546746464特别有问题,也尝试过:

> specify_decimal(y, 12)
[1] "129483.199999999997"    "1.354674687400"         "687676846.254674673080"
> formatC(y, digits = 12, format = "f")
[1] "129483.199999999997"    "1.354674687400"         "687676846.254674673080"
> formattable(y, digits = 12, format = "f")
[1] 129483.199999999997    1.354674687400         687676846.254674673080
> sprintf(y, fmt='%#.12g')
[1] "129483.200000" "1.35467468740" "687676846.255"
> sprintf(y, fmt='%#.22g')
[1] "129483.1999999999970896" "1.354674687399999966075" "687676846.2546746730804"

预期结果:

[1] "129483.199999999997" "1.354674687400" "687676846.254674646400"

精度损失似乎仅发生一次,不再重复。

> require(dplyr)
> convert <- function(x) as.numeric(as.character(x))
> 687676846.2546746464 %>% convert
[1] 687676846.25467503
> 687676846.2546746464 %>% convert %>% convert %>% convert
[1] 687676846.25467503

这里我只有5位数的精度,但是更麻烦的是我事先不知道要达到什么精度。

1 个答案:

答案 0 :(得分:0)

最后,我可以使用这些功能来完成我想做的事情。 addtrailingzeroes将在十进制后的x处添加多个零。

nbdec <- function(x) {
  x1 <- as.character(x)
  xsplit <- strsplit(x1,"\\.")
  xlength <- sapply(xsplit, function(d) nchar(d)[2])
  xlength <- ifelse(is.na(xlength), 0, xlength)
  return(xlength)
}

trailingzeroes <- function(x, dig) {
  res <- rep(NA, length(x))
  for( i in 1:length(x)) {
    if(!is.na(x[i])) res[i] <- { paste0(rep(0,max(0,dig-nbdec(x[i]))), collapse="") }
    else { res[i] <- ""}
    }
return(res)
}

trailingcommas <- function(x) ifelse(is.na(x), NA, ifelse(nbdec(x)==0, ".",""))

addtrailingzeroes <- function(x, digits) {
  return(ifelse(!is.na(x), paste0(x, trailingcommas(x), trailingzeroes(x, digits)),NA))
}

但是,要抑制不准确和舍入错误,必须先使用roundnumerics.max裁剪x:

roundnumerics.max <- function(df, startdig=12) {
  for(icol in 1:ncol(df)) {
    if( is.numeric(df[,icol])) {
      dig <- startdig
      while(any(!as.numeric(as.character(df[,icol])) %==% df[,icol])) {
        dig <- dig-1
        df[,icol] <- round(df[,icol], digits=dig)
        if(dig==0) {
          break
          pprint("ERROR: zero numeric accuracy")
        }
      } 
      pprint("Numeric accuracy for column ",icol," ", colnames(df)[icol], " is ", dig)
    }
  }
  return(data.frame(df, stringsAsFactors = F))
}

这是缓慢的过程,远非优雅……我仍然认为很难相信R对16个有效数字有这样的精度限制,并且添加了不准确的噪声,当您尝试增加digits选项... 不显示,让您知道...