我正在尝试使用cor()函数执行Pearson相关,但输出只给出1和-1,而不是系数本身。所以当我用corrplot()绘制矩阵时,我只看到那些1和-1的值。我该如何解决? 我的数据集可以找到here,并在下面查看我的脚本:
##Must load the libraries we will need! IF you have not installed the packages, do that before you start.
library("corrplot")
##Load in your datasets
D1=BPT5test
##if you don't have a Y (i.e, you want the same thing to be in both axis), leave this blank
D2=
##Run the spearman correlation. If you want to do a Pearson, change "spearman to "pearson"
##If you have 0s in your dataset, set use = "complete.obs", if you have no 0s, set use = "everything"
CorTest=cor(D1, use = "everything", method = "pearson")
##Let's get to plotting!
##Lots of changing you can do!
#Method can be "circle" "square" "pie" "color"
#ColorRampPalette can be changed, "blue" being the negative, "White" being '0', and "red" being the positive
#Change the title to whatever you want it to be
#tl.col is the color of your labels, this can be set to anything.. default is red
CorGraph=corrplot(CorTest, method = "circle", col = colorRampPalette(c("blue","white","red"))(200), title = "Pearson's Correlation of High-Fat Sugar at 8 weeks", tl.cex = .5, tl.col = "Black",diag = TRUE, cl.ratio = 0.2)
答案 0 :(得分:3)
您的数据集每个变量只包含2个观察值。仅包含两个观测值的任意两个变量之间的相关性始终为-1或1.为了自己查看,请尝试运行replicate(1e2, cor(rnorm(2), rnorm(2)))
,计算由两个观测值组成的两个变量之间的100个相关性。结果始终为-1或1。
答案 1 :(得分:2)
这是因为你只有两个观察列。
test <- data.frame(a=c(1,2),b=c(2,3),c=c(4,-2))
cor(test, use = "everything", method = "pearson")
a b c
a 1 1 -1
b 1 1 -1
c -1 -1 1
您不能指望只有两个值的不同输出,请检查Pearson correlation formula。
由于三个或更多,你会有更多变化:
test <- data.frame(a=c(1,2,3),b=c(2,3,5),c=c(4,-2,-10))
cor(test, use = "everything", method = "pearson")
a b c
a 1.0000000 0.9819805 -0.9966159
b 0.9819805 1.0000000 -0.9941916
c -0.9966159 -0.9941916 1.0000000