同时获得包含多个变量的基本统计信息的摘要表

时间:2020-07-06 20:43:16

标签: r statistics

我有一个包含30个变量和2000个个体的文件。但摘要将如下所示:

  Name Farm milk protein fat dim data 
  1110 PE   10.3  3,7    3.6 146 06/07/20
  3322 PE   18.3  3,3    3.9 158 06/03/20 
  1125 PE   22.3  3,4  4.6 210 12/04/20 
  1777 PE   14.3  3,0    4.2 100 28/04/20
  8970 PE   17.3  3,2    3.6 185 14/06/20

我需要:

  1. 计算所有变量的均值,中位数,最小值,最大值,四分位数,并将所有结果分组到一个唯一/单个表中,我使用了三个选项:
sapply (fenotipi2, summary) R ##print only two columns with all values ​​together
library(purrr) 
library(dplyr)

phenotype2 %>% 
    map (~ summary (.)) %>%
    rbind.data.frame 
## Error in rbind.data.frame(.): 
##       invalid list argument: all variables should have the same length
library(dplyr)
library(tidyr) 
phenotype2 %>% 
    pivot_longer(everything()) %>%
    summarize_at(vars (value), list(Min = min, Mean = mean, 
                                    Max = max, Sd = sd))  
## A tibble: 1 x 4
## Min Mean Max Sd   <dbl> <dbl> <dbl> <dbl> 1 NA NA NA NA
  1. 我还必须一步一步完成所有直方图和连续变量的分布
require(MVN)
    
result = mvn(data = fenotipi2 [-4], subset = "Species", mvnTest = "hz",
                 univariateTest = "AD", univariatePlot = "histogram",
                 multivariatePlot = "qq", multivariateOutlierMethod = "adj",
                 showOutliers = TRUE, showNewData = TRUE)

我没有得到想要的结果。

1 个答案:

答案 0 :(得分:1)

我们可以使用describe中的Hmisc,然后使用capture.output将其写入文件中

capture.output(Hmisc::describe(fenotipi2), file = "yourfile.txt")