我有一个数据框,比如集群。我知道我们可以得到如下摘要统计数据:
summary(cluster)
给出类似的东西:
cluster client domain
Min. :4 RRr_IL:40449 P621:409
1st Qu.:4
Median :4
Mean :4
3rd Qu.:4
Max. :4
shift_end shift_length_avg patients_seen_cnt
Min. : 0.00 Min. :6.000 Min. : 1.000
1st Qu.:13.00 1st Qu.:7.000 1st Qu.: 4.000
Median :15.00 Median :8.000 Median : 6.000
Mean :13.93 Mean :8.128 Mean : 9.107
3rd Qu.:17.00 3rd Qu.:9.000 3rd Qu.: 11.000
Max. :23.00 Max. :9.000 Max. :273.000
我如何解析此摘要。这似乎是一个字符数组。
例如,如果我想达到' shift_length_avg'从总结来看,我该怎么做?
答案 0 :(得分:1)
summary
的结果是table
,这是一个字符矩阵。
summary(iris)
# Sepal.Length Sepal.Width Petal.Length Petal.Width Species
# Min. :4.300 Min. :2.000 Min. :1.000 Min. :0.100 setosa :50
# 1st Qu.:5.100 1st Qu.:2.800 1st Qu.:1.600 1st Qu.:0.300 versicolor:50
# Median :5.800 Median :3.000 Median :4.350 Median :1.300 virginica :50
# Mean :5.843 Mean :3.057 Mean :3.758 Mean :1.199
# 3rd Qu.:6.400 3rd Qu.:3.300 3rd Qu.:5.100 3rd Qu.:1.800
# Max. :7.900 Max. :4.400 Max. :6.900 Max. :2.500
str(summary(iris))
# 'table' chr [1:6, 1:5] "Min. :4.300 " "1st Qu.:5.100 " ...
# - attr(*, "dimnames")=List of 2
# ..$ : chr [1:6] "" "" "" "" ...
# ..$ : chr [1:5] " Sepal.Length" " Sepal.Width" " Petal.Length" " Petal.Width" ...
summary(iris)[, 3]
# "Min. :1.000 " "1st Qu.:1.600 " "Median :4.350 " "Mean :3.758 "
# "3rd Qu.:5.100 " "Max. :6.900 "
colnames(summary(iris))
# [1] " Sepal.Length" " Sepal.Width" " Petal.Length" " Petal.Width" " Species"
summary(iris)[, " Petal.Length"]
# "Min. :1.000 " "1st Qu.:1.600 " "Median :4.350 " "Mean :3.758 "
# "3rd Qu.:5.100 " "Max. :6.900 "
答案 1 :(得分:0)
试试这个:
Med<-median(cluster$shift_length_avg, na.rm = FALSE)