加速对独特观察的成对计数

时间:2016-06-07 21:01:15

标签: r performance for-loop

我有一个对象向量(object)以及相应的时间帧向量(tframe),其中观察了对象。对于每对唯一对象,我想计算观察两个对象的时间帧数

我可以使用for()循环编写代码,但随着唯一对象数量的增加,运行需要很长时间。我如何更改代码以加快运行时间?

下面是一个包含4个独特对象的示例(实际上我有大约300个)。例如,对象ac都在时间范围12中观察到,因此它们的计数为2。对象bd从未在同一时间范围内被观察到,因此它们的计数为0

object <- c("a", "a", "a", "b", "b", "c", "c", "c", "c", "d")
tframe <- c(1, 1, 2, 2, 3, 1, 2, 2, 3, 1)

uo <- unique(object)
n <- length(uo)

mpairs <- matrix(NA, nrow=n*(n-1)/2, ncol=3, dimnames=list(NULL, 
  c("obj1", "obj2", "sametf")))

row <- 0
for(i in 1:(n-1)) {
for(j in (i+1):n) {
  row <- row+1
  mpairs[row, "obj1"] <- uo[i]
  mpairs[row, "obj2"] <- uo[j]
  # no. of time frames in which both objects in a pair were observed
  intwin <- intersect(tframe[object==uo[i]], tframe[object==uo[j]])
  mpairs[row, "sametf"] <- length(intwin)
}}

data.frame(object, tframe)
   object tframe
1       a      1
2       a      1
3       a      2
4       b      2
5       b      3
6       c      1
7       c      2
8       c      2
9       c      3
10      d      1

mpairs
     obj1 obj2 sametf
[1,] "a"  "b"  "1"   
[2,] "a"  "c"  "2"   
[3,] "a"  "d"  "1"   
[4,] "b"  "c"  "2"   
[5,] "b"  "d"  "0"   
[6,] "c"  "d"  "1"   

1 个答案:

答案 0 :(得分:4)

您可以使用crossproduct来获取协议计数。然后你可以重塑 数据,如果需要的话。

实施例

object <- c("a", "a", "a", "b", "b", "c", "c", "c", "c", "d")
tframe <- c(1, 1, 2, 2, 3, 1, 2, 2, 3, 1)

# This will give you the counts
# Use code from Jean's comment
tab <- tcrossprod(table(object, tframe)>0)

# Reshape the data
tab[lower.tri(tab, TRUE)] <- NA 
reshape2::melt(tab, na.rm=TRUE)