我正在从事主成分分析(PCA)。
我发现ggfortify
的效果不错,但想进行一些手动调整。
然后在此处尝试绘制PCA结果,如下所示:
evec <- read.table(textConnection("
PC1 PC2 PC3
-0.5708394 -0.6158420 -0.5430295
-0.6210178 -0.1087985 0.7762086
-0.5371026 0.7803214 -0.3203424"
), header = TRUE, row.names = c("M1", "M2", "M3"))
res.ct <- read.table(textConnection("
PC1 PC2 PC3
-1.762697 -1.3404825 -0.3098503
-2.349978 -0.0531175 0.6890453
-1.074205 1.5606429 -0.6406848
2.887080 -0.7272039 -0.3687029
2.299799 0.5601610 0.6301927"
), header = TRUE, row.names = c("A", "B", "C", "D", "E"))
require(ggplot2)
require(dplyr)
gpobj <-
res.ct %>%
ggplot(mapping = aes(x=PC1, y=PC2)) +
geom_point(color="grey30") +
annotate(geom="text", x=res.ct$PC1*1.07, y=res.ct$PC2*1.07,
label=rownames(res.ct))
for (i in 1:nrow(evec))
{
PCx <- evec[i,1]
PCy <- evec[i,2]
axisname <- rownames(evec)[[i]]
gpobj <- gpobj +
geom_segment(
data = evec[i,],
aes(
x = 0, y = 0,
xend = PC1, yend = PC2
# xend = PCx, yend = PCy #not work as intended
),
arrow = arrow(length = unit(4, "mm")),
color = "red"
) +
annotate(
geom = "text",
x = PCx * 1.15, y = PCy * 1.15,
label = axisname,
color = "red"
)
}
gpobj
该代码运行良好,但是当我尝试使用带注释的行xend = PCx, yend = PCy
而不是xend = PC1, yend = PC2
时,它不能按我预期的那样很好地工作,它不会显示所有箭头。
xend = PC1, yend = PC2
运作良好:
xend = PCx, yend = PCy
不:
问题:
当起点和终点由环境变量指定而不是由geom_segment()
的变量名称引用时,为什么data =
不保留上一个箭头?
答案 0 :(得分:3)
在您使用的代码中,当在美学映射PCx
中指定了PCy
/ aes(...)
时(与将其硬编码为aes(...)
之外的固定美学值相反,就像对annotate
层所做的那样),仅在绘制/打印ggplot对象gpobj
时才评估实际值。
这意味着PCx
/ PCy
的值在for循环的外部中求值。至此,它们对应于i = 3
所采用的最后一个值,这就是为什么只有一个箭头段(实际上是三个箭头叠置)可见的原因。将xend = PCx, yend = PCy
移到aes(...)
之外应该会达到您想要的外观。
我确实想知道为什么您首先选择使用for循环。这样的东西不会达到相同的目的吗?
# convert row names to explicit columns
res.ct <- tibble::rownames_to_column(res.ct)
evec <- tibble::rownames_to_column(evec)
# plot
res.ct %>%
ggplot(mapping = aes(x=PC1, y=PC2)) +
geom_point(color="grey30") +
geom_text(aes(x = PC1 * 1.07, y = PC2 * 1.07,
label = rowname)) +
geom_segment(data = evec,
aes(x = 0, y = 0, xend = PC1, yend = PC2, group = rowname),
arrow = arrow(length = unit(4, "mm")),
color = "red") +
geom_text(data = evec,
aes(x = PC1 * 1.15, y = PC2 * 1.15, label = rowname),
colour = "red")