Question

如何在不将任何值分类为“其他”的情况下显示完整的输出摘要？

summary(d)
     Date.of.Sale   City               Department      Product     
 1/18/2015 :  149   A:5290   Footwear Mens  : 538   13245  :  255  
 1/25/2015 :  149   B:2078   Home Furnishing:1937   15350  :  255  
 11/23/2014:  149   C:5088   Infant W-Wear  : 992   15352  :  255  
 11/30/2014:  149            Ladies Lower   :1735   15353  :  255  
 12/14/2014:  149            Ladies Upper   :1805   15355  :  255  
 12/21/2014:  149            Mens Lower     :2039   15356  :  255  
 (Other)   :11562            Mens Upper     :3410   (Other):10926  
            Sale      Predicted.Sale       Flag      
 0            :3963   0      :3279   Forecast: 1341  
 Not Available:1341   1      :1951   History :11115  
 1            :1145   2      : 946                   
 2            : 797   3      : 700                   
 3            : 557   4      : 572                   
 4            : 498   5      : 438                   
 (Other)      :4155   (Other):4570

Answer 1

旁白：您的数据看起来像因素列应该是数字。您可能希望看一下，因为它可能会在以后的分析中为您带来问题。

就您对summary()的调用而言，您可以调整maxsum参数。我们在help(summary)中发现，这可用于更改摘要中显示的信息量

maxsum - 整数，表示应为因子显示多少级别。

因此，让我们看一下两列数据框示例 -

set.seed(12)
df <- data.frame(
    a = sample(letters[1:8], 1e3, TRUE), 
    b = sample(letters[1:10], 1e3, TRUE)
)

在没有其他参数的情况下调用summary()，我们得到＆＃34;其他＆＃34;列在每列摘要的底部。

summary(df)
#       a             b      
# d      :132   g      :118  
# c      :131   b      :108  
# f      :131   e      :106  
# a      :123   f      :104  
# g      :123   d      :103  
# e      :122   j      :103  
# (Other):238   (Other):358

现在，如果我们将maxsum调整为所有列的唯一值的最大长度，我们将获得列出的所有值。

summary(df, maxsum = max(lengths(lapply(df, unique))))
#  a       b      
#  a:123   a: 94  
#  b:120   b:108  
#  c:131   c: 99  
#  d:132   d:103  
#  e:122   e:106  
#  f:131   f:104  
#  g:123   g:118  
#  h:118   h: 92  
#          i: 73  
#          j:103

请注意，maxsum也可能是maxsum = length(Reduce(union, df))，并且假设您正在使用数据框。

R中显示的摘要很短，许多术语显示为“其他”

1 个答案: