Question

我对列表搜索时的效率差异有疑问。为什么这两者之间存在差异？

test_list= [2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50]

第一个 -

def linearSearch(A,x):
    if x in A:
        return True
    return False

第二个 -

def linearSearch_2(A,x):
    for element in A:
        if element == x:
            return True
    return False

测试它们

%timeit linearSearch(test_list, 3)
438 ns ± 5.86 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

%timeit linearSearch_2(test_list, 3)
1.28 µs ± 7.05 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

当我使用更大的列表时，差异仍然存在。这两种方法之间有什么根本区别吗？

Answer 1

虽然从理论上讲，这些应该在同一时间内完成，但Python的in运算符编写为以原始C级别工作，因此比编写自己的{{1}更快完成在Python中。

但是，如果您要将第二个代码段转换为for-loop，那么它会超出Python中的第一个代码段，因为C更低级别，因此运行速度更快。

注意：的

第一个功能几乎没用，因为它与：

相同

现在很清楚，无论何时你打电话，你都可以直接写：def linearSearch(A,x): return x in A来产生相同的结果！

出于兴趣，我在x in A写了第二个片段，但是为了让时间更加夸张，让它做了整整C次：

输出：

#include <stdio.h>
#include <time.h>

void main(){
    clock_t begin = clock();
    for (int s = 0; s < 1000000; s++){
        int x = 3;
        int a[25] = {2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50};
        for (int i = 0; i < 25; i++){
            if (i == x) break;
        }
    }
    printf("completed in %f secs\n", (double)(clock() - begin) / CLOCKS_PER_SEC);
}

而我在Python中的第一个片段的修改版本：

completed in 0.021514 secs

输出：

import time
start = time.time()

for _ in range(1000000):
    x = 3
    l = [2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50]
    if x in l:
        continue;

print("completed in", time.time() - start, "seconds")

Python中2线性线搜索的效率差异

1 个答案: