最短距离算法Python

时间:2011-12-02 20:54:59

标签: python breadth-first-search

我想创建一个简单的广度优先搜索算法,它返回最短路径。

演员信息词典将地图和演员映射到演员出现的电影列表中:

actor_info = { "act1" : ["movieC", "movieA"], "act2" : ["movieA", "movieB"], 
     "act3" :["movieA", "movieB"], "act4" : ["movieC", "movieD"], 
     "act5" : ["movieD", "movieB"], "act6" : ["movieE"], 
     "act7" : ["movieG", "movieE"], "act8" : ["movieD", "movieF"], 
     "KevinBacon" : ["movieF"], "act10" : ["movieG"], "act11" : ["movieG"] }

这反过来将电影映射到出现在其中的演员列表:

movie_info = {'movieB': ['act2', 'act3', 'act5'], 'movieC': ['act1', 'act4'], 
      'movieA': ['act1', 'act2', 'act3'], 'movieF': ['KevinBacon', 'act8'], 
      'movieG': ['act7', 'act10', 'act11'], 'movieD': ['act8', 'act4', 'act5'], 
      'movieE': ['act6', 'act7']}

所以打电话

shortest_dictance("act1", "Kevin Bacon", actor_info, movie_info)

我应该3 act1 movieC Act4 movieD Act8 movie F KevinBacon def shotest_distance(actA, actB, actor_info, movie_info): '''Return the number of movies required to connect actA and actB. If theres no connection return -1.''' # So we keep 2 lists of actors: # 1.The actors that we have already investigated. # 2.The actors that need to be investigated because we have found a # connection beginning at actA. This list must be # ordered, since we want to investigate actors in the order we # discover them. # -- Each time we put an actor in this list, we also store # her distance from actA. investigated = [] to_investigate = [actA] distance = 0 while actB not in to_investigate and to_investigate!= []: for actor in to_investigate: to_investigated.remove(actA) investigated.append(act) for movie in actor_info[actor]: for co_star in movie_info[movie]: if co_star not in (investigated and to_investigate): to_investigate.append(co_star) .... .... return d {{1}} {{1}} {{1}} {{1}}与{{1}}一起使用。所以最短的距离是3。

到目前为止,我有这个:

{{1}}

我无法找到合适的方法来跟踪每次迭代代码发现的距离。此外,代码似乎在时间上非常无效。

2 个答案:

答案 0 :(得分:2)

首先创建一个图形来连接所有节点然后运行shortest_path代码(可能有一个有效的图形库来执行此操作而不是下面提到的功能,然而这个是优雅的)然后找出所有从最短路径开始的电影名称数量。

for i in movie_info:
    actor_info[i] = movie_info[i]

def find_shortest_path(graph, start, end, path=[]):
    path = path + [start]
    if start == end:
        return path
    if not start in graph:
        return None
    shortest = None
    for node in graph[start]:
        if node not in path:
            newpath = find_shortest_path(graph, node, end, path)
            if newpath:
                if not shortest or len(newpath) < len(shortest):
                    shortest = newpath
    return shortest

L = find_shortest_path(actor_info, 'act1', 'act2')
print len([i for i in L if i in movie_info])

find_shortest_path来源:http://www.python.org/doc/essays/graphs/

答案 1 :(得分:1)

这看起来很有效。它会跟踪当前的一组电影。对于每一步,它都会查看所有尚未考虑过的一步电影(“看过”)。

actor_info = { "act1" : ["movieC", "movieA"], "act2" : ["movieA", "movieB"], 
     "act3" :["movieA", "movieB"], "act4" : ["movieC", "movieD"], 
     "act5" : ["movieD", "movieB"], "act6" : ["movieE"], 
     "act7" : ["movieG", "movieE"], "act8" : ["movieD", "movieF"], 
     "KevinBacon" : ["movieF"], "act10" : ["movieG"], "act11" : ["movieG"] }

movie_info = {'movieB': ['act2', 'act3', 'act5'], 'movieC': ['act1', 'act4'], 
      'movieA': ['act1', 'act2', 'act3'], 'movieF': ['KevinBacon', 'act8'], 
      'movieG': ['act7', 'act10', 'act11'], 'movieD': ['act8', 'act4', 'act5'], 
      'movieE': ['act6', 'act7']}

def shortest_distance(actA, actB, actor_info, movie_info):
    if actA not in actor_info:
        return -1  # "infinity"
    if actB not in actor_info:
        return -1  # "infinity"
    if actA == actB:
        return 0

    dist = 1
    movies = set(actor_info[actA])
    end_movies = set(actor_info[actB])
    if movies & end_movies:
        return dist

    seen = movies.copy()
    print "All movies with", actA, seen
    while 1:
        dist += 1
        next_step = set()
        for movie in movies:
            for actor in movie_info[movie]:
                next_step.update(actor_info[actor])
        print "Movies with actors from those movies", next_step
        movies = next_step - seen 
        print "New movies with actors from those movies", movies
        if not movies:
            return -1 # "Infinity"

        # Has actorB been in any of those movies?
        if movies & end_movies:
            return dist

        # Update the set of seen movies, so I don't visit them again
        seen.update(movies)

if __name__ == "__main__":
    print shortest_distance("act1", "KevinBacon", actor_info, movie_info)

输出

All movies with act1 set(['movieC', 'movieA'])
Movies with actors from those movies set(['movieB', 'movieC', 'movieA', 'movieD'])
New movies with actors from those movies set(['movieB', 'movieD'])
Movies with actors from those movies set(['movieB', 'movieC', 'movieA', 'movieF', 'movieD'])
New movies with actors from those movies set(['movieF'])
3

这是一个版本,它返回构成最小连接的电影列表(无连接时为None,如果actA和actB相同则为空列表。)

def connect(links, movie):
    chain = []
    while movie is not None:
        chain.append(movie)
        movie = links[movie]
    return chain

def shortest_distance(actA, actB, actor_info, movie_info):
    if actA not in actor_info:
        return None  # "infinity"
    if actB not in actor_info:
        return None  # "infinity"
    if actA == actB:
        return []

    # {x: y} means that x is one link outwards from y
    links = {}

    # Start from the destination and work backward
    for movie in actor_info[actB]:
        links[movie] = None
    dist = 1
    movies = links.keys()

    while 1:
        new_movies = []
        for movie in movies:
            for actor in movie_info[movie]:
                if actor == actA:
                    return connect(links, movie)
                for other_movie in actor_info[actor]:
                    if other_movie not in links:
                        links[other_movie] = movie
                        new_movies.append(other_movie)
        if not new_movies:
            return None # Infinity
        movies = new_movies

if __name__ == "__main__":
    dist = shortest_distance("act1", "KevinBacon", actor_info, movie_info)
    if dist is None:
        print "Not connected"
    else:
        print "The Kevin Bacon Number for act1 is", len(dist)
        print "Movies are:", ", ".join(dist)

这是输出:

The Kevin Bacon Number for act1 is 3
Movies are: movieC, movieD, movieF