Question

为我的SNP计算正确的Fst值时遇到很多麻烦。尽管我的数据中没有人口，但我添加了pop作为因素，将我的数据分为4个不同的人口。一个来自河流，一个来自湖泊，另外两个则是不同群体（不同物种）。

我正在为我的基因座使用STRUCTURE文件，缺失值为-9 ...我尝试将其编码为NA，因为我认为这会引起问题，但缺失必须是整数。

MG_Sample10                         3   2   3   2   3   0   2   2   0   0   3   3   0   1   1   2   0   2   0   3   1   0   3   1   3   2   1   3   3   1   0   2   3   0   2   -9  -9  3
MG_Sample13                         3   2   3   2   3   0   -9  1   0   0   3   2   2   1   1   1   0   2   0   3   -9  -9  3   0   3   2   1   3   3   3   2   2   2   3   2   -9  -9  3   0   2   2   3   1   1   0   0   3   2   2   1   3   0   1   1

我的种群在对应于样本的单列中被编码为1-4。我已经使用了下面的代码。

library(adegenet)

library(hierfstat)

library(reshape)

file <- read.structure("file.str")

population <-read.csv("population.csv", header= F)

file@pop <- as.factor (population$V1)

hierfile <- genind2hierfstat(file)

FST <- pairwise.fst(file, pop=NULL, res.type=c("dist", "matrix"))

我的Fst值一直很低，最高的是河流人口和湖泊人口之间的0.15。输出组的值为负。我希望我所比较的两个群体的得分大约为0.2，外出群体的得分大约为0.6，因为他们不应该杂交！

如何使用hierfstat计算成对Fst？

0 个答案: