Python:使用NetworkX和mplleaflet的图表

时间:2017-04-16 02:35:55

标签: python graph leaflet networkx

我有一个从边缘创建的networkx图:

user_id,edges
11011,"[[340, 269], [269, 340]]"
80973,"[[398, 279]]"
608473,"[[69, 28]]"
2139671,"[[382, 27], [27, 285]]"
3945641,"[[120, 422], [422, 217], [217, 340], [340, 340]]"
5820642,"[[458, 442]]"

示例

enter image description here

边缘是用户在群集之间的移动,由群集标签标识,例如[[340, 269], [269, 340]]。这表示用户从cluster 340cluster 269然后再回到cluster 340的移动。这些聚类具有以纬度和经度的形式存储在另一个文件中的坐标,例如:

cluster_label,latitude,longitude
0,39.18193382,-77.51885109
1,39.18,-77.27
2,39.17917928,-76.6688633
3,39.1782,-77.2617
4,39.1765,-77.1927

是否可以使用节点/集群的lat / long将图形的边缘链接到物理空间中各自的集群,而不是在图形的抽象空间中?如果是这样,我该怎么做呢?我想在地图上使用mplleaflet等包(如此处显示:http://htmlpreview.github.io/?https://github.com/jwass/mplleaflet/master/examples/readme_example.html)或直接在QGIS / ArcMap中绘制图形。

修改

我正在尝试将带有群集质心坐标的csv转换为字典,但是,我遇到了几个错误。主要是NetwotkXError: Node 0 has no positionIndexError: too many indices for array.以下是我尝试转换为字典然后使用mplleaflet进行图表的方式。

import csv
import networkx as nx
import pandas as pd
import matplotlib.pyplot as plt
import time
import mplleaflet


g = nx.Graph()

# Set node positions as a dictionary
df = pd.read_csv('G:\Programming Projects\GGS 681\dmv_tweets_20170309_20170314_cluster_centroids.csv', delimiter=',')
df.set_index('cluster_label', inplace=True)
dict_pos = df.to_dict(orient='index')
#print dict_pos

for row in csv.reader(open('G:\Programming Projects\GGS 681\dmv_tweets_20170309_20170314_edges.csv', 'r')):
    if '[' in row[1]:       #
        g.add_edges_from(eval(row[1]))

# Plotting with matplotlib
#nx.draw(g, with_labels=True, alpha=0.15, arrows=True, linewidths=0.01, edge_color='r', node_size=250, node_color='k')
#plt.show()

# Plotting with mplleaflet
fig, ax = plt.subplots()

nx.draw_networkx_nodes(g,pos=dict_pos,node_size=10)
nx.draw_networkx_edges(g,pos=dict_pos,edge_color='gray', alpha=.1)
nx.draw_networkx_labels(g,dict_pos, label_pos =10.3)
mplleaflet.display(fig=ax.figure)

1 个答案:

答案 0 :(得分:2)

是的,很容易实现。尝试这方面的事情。 创建一个字典,其中节点(cluster_label)是键,经度纬度保存为列表中的值。我会使用pd.read_csv()来读取csv,然后使用df.to_dict()来创建字典。它应该如下所示:

 dic_pos = {u'0': [-77.51885109, 39.18193382],
 u'1': [-76.6688633, 39.18],
 u'2': [-77.2617, 39.1791792],
 u'3': [-77.1927, 39.1782],
 .....

然后在地图上绘制图表就像以下一样简单:

import mplleaflet

fig, ax = plt.subplots()

nx.draw_networkx_nodes(GG,pos=dic_pos,node_size=10,node_color='red',edge_color='k',alpha=.5, with_labels=True)
nx.draw_networkx_edges(GG,pos=dic_pos,edge_color='gray', alpha=.1)
nx.draw_networkx_labels(GG,pos=dic_pos, label_pos =10.3)

mplleaflet.display(fig=ax.figure) 

如果它没有产生预期结果,请尝试反转纬度,经度。