Question

我有一些代码经常用于生成大批量的条形图。条形图沿x轴按标签聚集，然后每个单独的条形图在条形图顶部都有一个标签。有时，单个条形标签的文本在绘图区域的顶部运行并被截断 - 取决于x轴标签的长度，单个条形标签的长度以及条形本身的高度。

我可以手动调整每个情节的scale_y_continuous(limits=c(0,100))以使其看起来不错。例如，如果条形高达100％但标签很长，我可能会使用：scale_y_continuous(limits=c(0,140),breaks=c(0,25,50,75,100))。这会使y轴刻度标记停止在100％，但在标签区域的顶部留下额外的空间。如果一个条形仅上升到40％并且标签很短，那么将y限制保留在c（0,140）看起来很糟糕，因为在100％标记之上有太多的空白区域。

因为我制作了大量的这些图，我想要一种自动确定如何为每个图设置scale_y_continuous的限制的方法。手动完成操作非常耗时。

有没有办法测试标签是否被截断或在绘图区域外运行？如果是这样的话，我可以通过测试看看标签是否被切断然后稍微扩展y限制直到没有截断来迭代我看起来很漂亮的情节。

Answer 1

我不认为这可以做到。我认为这归结为你没有完全理解ggplot2所基于的语法。具体来说，ggplot如何从数据值映射到审美值。

这可以通过一个简单的思想实验直观地理解：

library(ggplot2)
data(mtcars)
p <- ggplot(mtcars, aes(wt, mpg, label = rownames(mtcars)))
# consider
p + geom_text()
# vs 
p + geom_text(size= 15)

第一个地块p + geom_text()和第二个地方p + geom_text(size=15)之间有什么区别？从数据（结构）的角度来看，它们是相同的。唯一的区别是用于从数据元素映射到美学的缩放。但我的理解是没有存储从数据元素到像素元素的映射;所以你无法查看结果位置。

通过检查ggplot对象的结构，这种预感似乎很明显：

p2 <- p + geom_text()
str(p2)
List of 9
 $ data       :'data.frame':    32 obs. of  11 variables:
  ..$ mpg : num [1:32] 21 21 22.8 21.4 18.7 18.1 14.3 24.4 22.8 19.2 ...
  ..$ cyl : num [1:32] 6 6 4 6 8 6 8 4 4 6 ...
  ..$ disp: num [1:32] 160 160 108 258 360 ...
  ..$ hp  : num [1:32] 110 110 93 110 175 105 245 62 95 123 ...
  ..$ drat: num [1:32] 3.9 3.9 3.85 3.08 3.15 2.76 3.21 3.69 3.92 3.92 ...
  ..$ wt  : num [1:32] 2.62 2.88 2.32 3.21 3.44 ...
  ..$ qsec: num [1:32] 16.5 17 18.6 19.4 17 ...
  ..$ vs  : num [1:32] 0 0 1 1 0 1 0 1 1 1 ...
  ..$ am  : num [1:32] 1 1 1 0 0 0 0 0 0 0 ...
  ..$ gear: num [1:32] 4 4 4 3 3 3 3 4 4 4 ...
  ..$ carb: num [1:32] 4 4 1 1 2 1 4 2 2 4 ...
 $ layers     :List of 2
  ..$ :Classes 'proto', 'environment' <environment: 0x0000000015eb15f8> 
  ..$ :Classes 'proto', 'environment' <environment: 0x0000000015f6d0a0> 
 $ scales     :Reference class 'Scales' [package "ggplot2"] with 1 field
  ..$ scales: list()
  ..and 23 methods, of which 9 are  possibly relevant:
  ..  add, clone, find, get_scales, has_scale, initialize, input, n, non_position_scales
 $ mapping    :List of 3
  ..$ x    : symbol wt
  ..$ y    : symbol mpg
  ..$ label: language rownames(mtcars)
 $ theme      : list()
 $ coordinates:List of 1
  ..$ limits:List of 2
  .. ..$ x: NULL
  .. ..$ y: NULL
  ..- attr(*, "class")= chr [1:2] "cartesian" "coord"
 $ facet      :List of 1
  ..$ shrink: logi TRUE
  ..- attr(*, "class")= chr [1:2] "null" "facet"
 $ plot_env   :<environment: R_GlobalEnv> 
 $ labels     :List of 3
  ..$ x    : chr "wt"
  ..$ y    : chr "mpg"
  ..$ label: chr "rownames(mtcars)"
 - attr(*, "class")= chr [1:2] "gg" "ggplot"

如您所见，不存储各种几何对象的坐标像素位置（本例中为文本）。我猜这是因为ggplot2意味着交互使用，但这只是猜测。也就是说，您可以轻松更改绘图。保持情节的唯一原因是通过ggsave或grDevices函数之一（png等），在这种情况下，图形将保存到文件中。

如果您查看Hadley关于ggplot的书中的图3.7，这是图形生成过程的示意图，似乎不会保存从数据元素到像素位置的映射存储，而是每次生成

...如果对ggplot内部工作有更深入了解/已经检查过源代码的人有不同的意见，请加入，我会更新或删除我的答案。

如何检测标签是否在ggplot2图中的绘图区域运行？

1 个答案: