Question

这是一个R问题。

我有两个矩阵，＆＃34; y＆＃34;和＆＃34; l＆＃34;：

> head(y)
    SNP Category
1 29351  exclude
2 29357  exclude
3 29360  exclude
4 29372  include
5 29426  include
6 29432  include

> head(l)
  start  stop
1   246 11012
2 11494 13979
3 14309 18422
4 20728 20995
5 21457 29345
6 30035 31693

如果矩阵y中的行具有值＆＃34;包括＆＃34;在第二列中，我想检查矩阵y中第一列中的相应值是否位于＆＃34; start＆＃34;并且＆＃34;停止＆＃34;矩阵中的值l。如果矩阵y中的值确实位于矩阵l中的值之间或之间，则在矩阵y中替换值＆＃34; include＆＃34;用＆＃34;排除＆＃34;。我想我可以用嵌套for循环来做，但想知道更优雅和更快的方式。矩阵的长度不等。谢谢。

Answer 1

这很有效，但很慢。

y <- read.csv(file="SNP_pos_categorised0.99cutoff.csv", header=T)
l <- read.csv("SNPsToMoveFromINCLUDEtoEXCLUDE.csv", header=T)

colnames(y)
#[1] "SNP"      "Category"

levels(y$Category)
#[1] " exclude" " include"

colnames(l)
#[1] "start" "stop"

#start processing
for(i in 1:nrow(y))
{
    if(y[i,"Category"]==" include")
    {
        for(j in 1:nrow(l))
        {
            if(y[i,"SNP"] >= l[j,"start"] & y[i,"SNP"]<= l[j,"stop"])
            {
                y[i, "Category"] <- replace(y[i,"Category"], y[i,"Category"]==" include", " exclude" )
            }
        }
    }
}

基于另一个矩阵中的条件替换一个矩阵中的值

1 个答案: