Question

我有两个数据帧，＆＃34; a＆＃34;和＆＃34; b＆＃34;。他们都有gps数据，但是＆＃34; a＆＃34;有1000行和＆＃34; b＆＃34;有5行。我正在将距离与半正式公式进行比较，但我想应用这个函数，以便每一行都是＆＃34; a＆＃34;比较＆＃34; b＆＃34;的每一行。我应该得到5000个结果。

这是我到目前为止所做的，但它只给了我1000个结果：

library(geosphere)

for(i in 1:nrow(a)){
  distHaversine(a[,c(11,9)],b[,c(4,2)])
}

提前感谢您的任何帮助。

修改

我发现了一个更好的解决方案，可以减少代码和计算时间：

library(geosphere)

result <- distm(a[ , c(11, 9)], b[ , c(4, 2)], fun = distHaversine)

Answer 1

可能类似以下内容。

result <- matrix(numeric(nrow(a)*nrow(b)), ncol = nrow(b))

for(i in seq_len(nrow(a))){
    for(j in seq_len(nrow(b))){
        result[i, j] <- distHaversine(a[i, c(11, 9)],b[j, c(4, 2)])
    }
}

result

Answer 2

这可能是您的解决方案：

indx <- expand.grid(a=1:1000,b=1:5)

res <- apply(indx,1,function(x) distHaversine(a[x[1],],b[x[2],]))

使用expand.grid我合并了两个data.frames的所有行索引，然后使用它们在apply函数内进行索引。

要追溯您计算的距离，您可以将结果作为列添加到索引中。

> head(cbind(indx,res))
  a b      res
1 1 1 12318145
2 2 1  5528108
3 3 1 11090739
4 4 1 14962267
5 5 1 19480911
6 6 1  8936878

为数据框中的每一行应用一个函数，用于另一个数据框

2 个答案: