R expss程序包:按统计数字格式/将不同格式应用于替代行

时间:2018-11-06 13:57:23

标签: r html-table formatting expss

我正在探索expss软件包,以便完全更改R的SPSS。我的标准表格在行中显示计数和百分比,有时还补充了其他统计信息。

是否可以通过统计信息或行更改数字格式?更具体地讲,我想用0位数字显示计数,用2位数字显示百分比,最好以%格式显示,用2位数字表示均值。

我在htmlTables和htmlTable.etable {expss}中进行了搜索,但找不到解决方法。

发送所有帮助

嗨,格雷戈里,

为您的利益而发送。 请参见下面的小示例。

Table as is

Table I'd like to see

Tx, micha

1 个答案:

答案 0 :(得分:1)

表是常用的data.frames,因此我们可以轻松地应用标准R格式化功能。示例:

library(expss)
data(mtcars)
mtcars = apply_labels(mtcars,
                      mpg = "Miles/(US) gallon",
                      cyl = "Number of cylinders",
                      disp = "Displacement (cu.in.)",
                      hp = "Gross horsepower",
                      drat = "Rear axle ratio",
                      wt = "Weight (1000 lbs)",
                      qsec = "1/4 mile time",
                      vs = "Engine",
                      vs = c("V-engine" = 0,
                             "Straight engine" = 1),
                      am = "Transmission",
                      am = c("Automatic" = 0,
                             "Manual"=1),
                      gear = "Number of forward gears",
                      carb = "Number of carburetors"
)


# custom formating function
custom_format = function(tbl, percent_digits = 2, count_digits = 0){
    percent_rows = grepl("\\|%$", tbl[[1]], perl = TRUE) # get rows with percent format
    count_rows = grepl("\\|N$", tbl[[1]], perl = TRUE) # get rows with count format
    # format each stat
    rounded_percent = format(tbl[percent_rows,-1], digits = percent_digits, nsmall = percent_digits) 
    rounded_count = format(tbl[count_rows,-1], digits = count_digits, nsmall = count_digits)
    # replcae data in orginal tables with formatted stat
    tbl[percent_rows,-1] = rounded_percent
    tbl[count_rows,-1] = rounded_count
    ##### remove NA which arise during formatting
    recode(tbl) = perl("^\\s*NA\\s*$") ~ ""
    tbl
}


## example
expss_output_viewer()
mtcars %>% 
    tab_cells(gear) %>% 
    tab_cols(total(), am) %>% 
    tab_stat_cases(label = "N", total_row_position = "above") %>% 
    tab_stat_cpct(label = "%", total_row_position = "none") %>% 
    tab_pivot(stat_position = "inside_rows") %>% 
    custom_format()

enter image description here