ggplot-带多个箭头的geom_segment()

时间:2019-12-14 00:20:57

标签: r ggplot2

我正在从事主成分分析(PCA)。 我发现ggfortify的效果不错,但想进行一些手动调整。

然后在此处尝试绘制PCA结果,如下所示:

evec <- read.table(textConnection("
  PC1        PC2        PC3
  -0.5708394 -0.6158420 -0.5430295
  -0.6210178 -0.1087985  0.7762086
  -0.5371026  0.7803214 -0.3203424"
), header = TRUE, row.names = c("M1", "M2", "M3"))

res.ct <- read.table(textConnection("
  PC1        PC2        PC3
  -1.762697 -1.3404825 -0.3098503
  -2.349978 -0.0531175  0.6890453
  -1.074205  1.5606429 -0.6406848
  2.887080 -0.7272039 -0.3687029
  2.299799  0.5601610  0.6301927"
), header = TRUE, row.names = c("A", "B", "C", "D", "E"))

require(ggplot2)
require(dplyr)
gpobj <- 
  res.ct %>%
  ggplot(mapping = aes(x=PC1, y=PC2)) +
  geom_point(color="grey30") +
  annotate(geom="text", x=res.ct$PC1*1.07, y=res.ct$PC2*1.07,
           label=rownames(res.ct))

for (i in 1:nrow(evec))
{
  PCx <- evec[i,1]
  PCy <- evec[i,2]
  axisname <- rownames(evec)[[i]]
  gpobj <- gpobj +
    geom_segment(
      data = evec[i,],
      aes(
        x = 0, y = 0,
        xend = PC1, yend = PC2
        # xend = PCx, yend = PCy  #not work as intended
      ),
      arrow = arrow(length = unit(4, "mm")),
      color = "red"
    ) +
    annotate(
      geom = "text",
      x = PCx * 1.15, y = PCy * 1.15,
      label = axisname,
      color = "red"
    )
}
gpobj

该代码运行良好,但是当我尝试使用带注释的行xend = PCx, yend = PCy而不是xend = PC1, yend = PC2时,它不能按我预期的那样很好地工作,它不会显示所有箭头。

xend = PC1, yend = PC2运作良好:

<code>xend = PC1, yend = PC2</code> works well

xend = PCx, yend = PCy不:

<code>xend = PCx, yend = PCy</code> does not

问题: 当起点和终点由环境变量指定而不是由geom_segment()的变量名称引用时,为什么data =不保留上一个箭头?

1 个答案:

答案 0 :(得分:3)

在您使用的代码中,当在美学映射PCx中指定了PCy / aes(...)时(与将其硬编码为aes(...)之外的固定美学值相反,就像对annotate层所做的那样),仅在绘制/打印ggplot对象gpobj时才评估实际值。

这意味着PCx / PCy的值在for循环的外部中求值。至此,它们对应于i = 3所采用的最后一个值,这就是为什么只有一个箭头段(实际上是三个箭头叠置)可见的原因。将xend = PCx, yend = PCy移到aes(...)之外应该会达到您想要的外观。

我确实想知道为什么您首先选择使用for循环。这样的东西不会达到相同的目的吗?

# convert row names to explicit columns
res.ct <- tibble::rownames_to_column(res.ct)
evec <- tibble::rownames_to_column(evec)

# plot
res.ct %>%
  ggplot(mapping = aes(x=PC1, y=PC2)) +
  geom_point(color="grey30") +
  geom_text(aes(x = PC1 * 1.07, y = PC2 * 1.07,
                label = rowname)) +
  geom_segment(data = evec,
               aes(x = 0, y = 0, xend = PC1, yend = PC2, group = rowname),
               arrow = arrow(length = unit(4, "mm")),
               color = "red") +
  geom_text(data = evec,
            aes(x = PC1 * 1.15, y = PC2 * 1.15, label = rowname),
            colour = "red")

plot