Question

我想知道一个向量在另一个向量中的起始索引。例如，对于df.rdd.coalesce(20).foreachPartitionAsync(process_partition)和c(1, 1)，它将是4。

重要的是我想寻找完全相同的向量。因此，对于c(1, 0, 0, 1, 1, 0, 1)内的c(1, 1)，它的值为c(1, 0, 1, 1, 1, 0)为假。

目前，我正在检查短向量是否包含在long中，如下所示：

c(1, 1) != c(1, 1, 1)

但是我不知道如何确定它的索引...

Answer 1

此功能应该起作用：

my_function <- function(x, find) {
  # we create two matrix from rle function
  m = matrix(unlist(rle(x)), nrow=2, byrow = T) 
  n = matrix(unlist(rle(find)), nrow=2, byrow = T)

  # for each column in m we see if its equal to n
  temp_bool = apply(m, 2, function(x) x == n) # this gives a matrix of T/F
  # then we simply sum by columns, if we have at least a 2 it means that we found (1,1) at least once
  temp_bool = apply(temp_bool, 2, sum)

  # updated part
  if (any(temp_bool==2)) {
    return(position = which(temp_bool==2)+1)
  } else {
    return(position = FALSE)
  }

}


my_function(x, find)
#[1] 4

my_function(y, find)
#[1] FALSE

为使内容更清晰，我展示了这两个apply的结果：

apply(m, 2, function(x) x == n)
#       [,1]  [,2] [,3]  [,4]  [,5]
# [1,] FALSE  TRUE TRUE FALSE FALSE
# [2,]  TRUE FALSE TRUE FALSE  TRUE  # TRUE-TRUE on column 3 is what we seek

apply(temp_bool, 2, sum)
#[1] 1 1 2 0 1

示例数据：

x <- c(1,0,0,1,1,0,1)
y <-  c(1,0,1,1,1,0)
find <- c(1,1) # as pointed this needs to be a pair of the same number

Answer 2

假设shortVec仅包含一个，而longVec仅包含零，并且一个使用rle和rep来创建与{具有相同长度的向量lens {1}}，以使每次运行中的每个元素都替换为该运行的长度。然后将其乘以longVec以将longVec中与0对应的元素清零。现在返回与等于longVec的元素对应的索引并取第一个。

length(shortVec)

Answer 3

这适用于以下示例。

students

R：在另一个向量中查找向量的索引（如果存在）

3 个答案: