如何使用ggbiplot使biplot名称更清晰

时间:2015-05-27 15:34:10

标签: r plot ggbiplot

我有一个可以从这里下载的数据 https://gist.github.com/anonymous/5f1135e4f750a39b0255

我尝试使用以下函数使用ggbiplot绘制PCA

add $1,$2,$3
nand $3,$2,$4
lw $2,$5,imm

然而,很难看到双标行的名称,

有没有办法让它更清晰或更好?

1 个答案:

答案 0 :(得分:1)

我认为一种更清晰的方法是使用varname.sizevarname.adjust参数调整标签的大小和位置。然而,有很多变量它仍然看起来很拥挤。通过增加箭头的长度(类似于stats::biplot()),使它看起来更好(imo)

# install ggbiplot
#require(devtools)
#install_github('ggbiplot','vqv')

library(httr) 
library(ggbiplot)

# read data
url <- "https://gist.githubusercontent.com/anonymous/5f1135e4f750a39b0255/raw/data.txt"
dat <- read.table(text=content(GET(url), as="text"), header=TRUE)

# pca 
data.pca <- prcomp (dat, center = TRUE, scale =TRUE)

# original plot + increase labels size and space from line
p <- ggbiplot(data.pca, obs.scale=1, 
              var.scale=1, circle=F, 
              varname.size=4, varname.adjust=2)  
p

enter image description here

# use coord_equal() to change size ratio of plot (excludes use of circle)
p <- p + coord_equal(1.5) + theme_classic()
p

enter image description here

要扩展箭头,需要重新计算x和y坐标。然后,您可以使用它们来编辑相关的凹凸,并更改任何其他参数(颜色,大小,旋转等)。 (你可以采用整个ggplotGrob(p)方法,但只需使用下面的grid.edit()。)

# function to rescale the x & y positions of the lines and labels
f <- function(a0, a1, M=M)
      {
      l <- lapply(as.list(environment()), as.numeric)
      out <- M* (l$a1 - l$a0) + l$a0
      grid::unit(out, "native")
      }  

# get list of grobs in current graphics window
grobs <- grid.ls(print=FALSE)  

# find segments grob for the arrows
s_id <- grobs$name[grep("segments", grobs$name)]

# edit length and colour of lines
seg <- grid.get(gPath(s_id[2]))     
grid.edit(gPath(s_id[2]),  
            x1=f(seg$x0, seg$x1, 2), 
            y1=f(seg$y0, seg$y1, 2),
            gp=gpar(col="red"))


# find text grob for the arrow labels
lab_id <- grobs$name[grep("text", grobs$name)]

# edit position of text, and rotate and colour labels
seg2 <- grid.get(gPath(lab_id)) 
grid.edit(gPath(lab_id),  
            x=f(seg$x0, seg2$x, 2), 
            y=f(seg$y0, seg2$y, 2),
            rot=0,
            gp=gpar(col="red"))

enter image description here

主观,如果这使它更好,也许只是使用biplot()或甚至定义新功能

更容易