将查询与已知列表匹配的算法,基于词典顺序呈现结果

时间:2015-07-19 02:22:56

标签: python algorithm search

我正在设计一种算法,在给定查询的列表中查找字符串,并返回列表中与查询匹配的字符串。这是基于字典顺序中的第一个匹配来回答的。如果查询是空字符串,则返回空字符串,如果查询不是已知列表中任何项目的开头,则返回-1。到目前为止,这是我的算法。有什么方法可以修改我的算法,使其运行得更快?

示例输入:

usernames: ["jBlame", "jannet"]
queries: ["j", "jm", "jbl", "JB"]

示例输出:

jannet
-1
jBlame
jBlame

这是我目前的实施。我一直绞尽脑汁试图找到提高代码速度的方法,但我找不到方法。

def name_finder(usernames, queries):
    users = sorted(usernames,key=lambda m:m.upper())
    for q in queries:
        if q=='':
            print ''
            break
        for user in users:
            if q.upper()==user.upper()[:len(q)]: 
                print user
                break
        else: print -1

1 个答案:

答案 0 :(得分:1)

执行速度 - 时间权衡。首先构建索引,这需要更多的空间和一些时间。使用该索引进行查询,每次find_name()调用都将获得O(1)时间复杂度的好处。

usernames = ["jBlame", "jannet"]
queries = ["j", "jm", "jbl", "JB"]

def build_index(usernames):
    """Build an index by given usernames

    :returns: index dict
    """
    result = {}
    # Sort should ignore cases
    for username in sorted(usernames, key=lambda x: x.lower()):
        for i in range(len(username)):
            # TODO: If you only want the first result matched, modify this line to 
            # let the index consumes less space
            result.setdefault(username[:(i + 1)].lower(), []).append(username)
    return result


def find_name(query, index):
    """return the matched username by given query and index
    """
    if not query:
        return ''
    result = index.get(query.lower())
    return result[0] if result else -1

index = build_index(usernames)
for query in queries:
    print find_name(query, index)