Skimr:如何删除直方图?

时间:2018-07-14 14:15:39

标签: r skimr

我想在Windows上使用R软件包skim中的函数skimr。不幸的是,在许多情况下,hist列打印不正确(带有许多类似<U+2587>的符号),如下例所示。

问题:是否有一种简单的方法可以禁用“ hist”列并阻止其打印或完全禁止计算?是否有类似hist = FALSE的选项?

capture.output(skimr::skim(iris))
#>  [1] "Skim summary statistics"                                                                         
#>  [2] " n obs: 150 "                                                                                    
#>  [3] " n variables: 5 "                                                                                
#>  [4] ""                                                                                                
#>  [5] "-- Variable type:factor ------------------------------------------------------------------------"
#>  [6] " variable missing complete   n n_unique                       top_counts"                        
#>  [7] "  Species       0      150 150        3 set: 50, ver: 50, vir: 50, NA: 0"                        
#>  [8] " ordered"                                                                                        
#>  [9] "   FALSE"                                                                                        
#> [10] ""                                                                                                
#> [11] "-- Variable type:numeric -----------------------------------------------------------------------"
#> [12] "     variable missing complete   n mean   sd  p0 p25  p50 p75 p100"                              
#> [13] " Petal.Length       0      150 150 3.76 1.77 1   1.6 4.35 5.1  6.9"                              
#> [14] "  Petal.Width       0      150 150 1.2  0.76 0.1 0.3 1.3  1.8  2.5"                              
#> [15] " Sepal.Length       0      150 150 5.84 0.83 4.3 5.1 5.8  6.4  7.9"                              
#> [16] "  Sepal.Width       0      150 150 3.06 0.44 2   2.8 3    3.3  4.4"                              
#> [17] "     hist"                                                                                       
#> [18] " <U+2587><U+2581><U+2581><U+2582><U+2585><U+2585><U+2583><U+2581>"                               
#> [19] " <U+2587><U+2581><U+2581><U+2585><U+2583><U+2583><U+2582><U+2582>"                               
#> [20] " <U+2582><U+2587><U+2585><U+2587><U+2586><U+2585><U+2582><U+2582>"                               
#> [21] " <U+2581><U+2582><U+2585><U+2587><U+2583><U+2582><U+2581><U+2581>"

将语言环境更改为中文(如this answer)不能解决问题,但会使情况更糟:

Sys.setlocale(locale = "Lithuanian")
df <- data.frame(x = 1:5, y = c("Ą", "Č", "Ę", "ū", "ž"))

Sys.setlocale(locale = "Chinese")
capture.output(skimr::skim(df))

#> Error in substr(names(x), 1, options$formats$.levels$max_char) : invalid multibyte string at '<c0>'

2 个答案:

答案 0 :(得分:6)

skim_with(numeric = list(hist = NULL))在“使用Skimr”小插图中。

答案 1 :(得分:0)

您也可以使用skim_without_charts代替skim

此处的文档中有更多详细信息:

https://www.rdocumentation.org/packages/skimr/versions/2.0.2/topics/skim