由于numpy.float64值,将NetworkX图导出为graphml会引发异常

时间:2017-12-19 19:10:59

标签: python numpy networkx graphml

我正在使用以下代码分析加权化学反应网络:

import networkx as nx
import pandas as pd

edge_data = pd.read_table('VULCAN 800.dat', sep=',')
edge_list=[]
edge_data = edge_data.fillna(value=0.1)
col1=edge_data['Column_1']
col2=edge_data['Column_2']
col3=edge_data['Column_3']
col4=edge_data['Column_4']
col5=edge_data['Column_5']
col6=edge_data['Column_6']
for i in range(1,560):
    edge_list.append((col1.iloc[i],col2.iloc[i],col5.iloc[i]))
if pd.isnull(col3.iloc[i]) != True:
    edge_list.append((col1.iloc[i],col3.iloc[i],col5.iloc[i]))
    edge_list.append((col2.iloc[i],col3.iloc[i],col5.iloc[i]))
if pd.isnull(col4.iloc[i]) != True:
    edge_list.append((col1.iloc[i],col4.iloc[i],col5.iloc[i]))
    edge_list.append((col2.iloc[i],col4.iloc[i],col5.iloc[i]))
if pd.isnull(col3.iloc[i]) != True and pd.isnull(col4.iloc[i]) != True:
    edge_list.append((col3.iloc[i],col4.iloc[i],col5.iloc[i]))

G=nx.Graph()
G.add_weighted_edges_from(edge_list)
G.remove_node(0.1)
nx.write_graphml(G, '/home/tessa/Git/Network_biosignatures/VULCAN 800k.graphml')

我想将生成的图形导出为graphml文件,以便我可以在Cytoscape中将其可视化,但由于某种原因,它会抛出错误消息:

Traceback (most recent call last):
 File "topo_measure_pn_hot jupiter_weighted.py", line 196, in <module>
nx.write_graphml(G, '/home/tessa/Git/Network_biosignatures/VULCAN 800k.graphml')
 File "<decorator-gen-202>", line 2, in write_graphml
 File "/usr/lib/python2.7/dist-packages/networkx/utils/decorators.py",      line 220, in _open_file
 result = func(*new_args, **kwargs)
 File "/usr/lib/python2.7/dist-packages/networkx/readwrite/graphml.py", line 82, in write_graphml
 writer.add_graph_element(G)
 File "/usr/lib/python2.7/dist-packages/networkx/readwrite/graphml.py", line 351, in add_graph_element
 self.add_edges(G,graph_element)
 File "/usr/lib/python2.7/dist-packages/networkx/readwrite/graphml.py", line 325, in add_edges
 self.add_attributes("edge", edge_element, data, default)
 File "/usr/lib/python2.7/dist-packages/networkx/readwrite/graphml.py", line 300, in add_attributes
 scope=scope, default=default_value)
 File "/usr/lib/python2.7/dist-packages/networkx/readwrite/graphml.py", line 288, in add_data
 '%s as data values.'%element_type)
 networkx.exception.NetworkXError: GraphML writer does not support <type 'numpy.float64'> as data values.

显然,graphml编写器不能很好地应对除图形的简单属性之外的任何东西,我猜它的重量是抛出错误。但是,我不确定 将权重值分配为numpy.float64,或者如何删除此赋值。

正在读入的数据格式如下:

1,H,H2O,OH,H2,1.75116588E-16,,,,,,,,,,,,,,,,,,,,,,,,,,,
2,H,H2O,OH,H2,4.00975292E-13,,,,,,,,,,,,,,,,,,,,,,,,,,,
3,O,H2,OH,H,9.25180896E-14,,,,,,,,,,,,,,,,,,,,,,,,,,,
4,O,H2,OH,H,1.04176774E-13,,,,,,,,,,,,,,,,,,,,,,,,,,,
5,O,H2O,OH,OH,1.04560994E-15,,,,,,,,,,,,,,,,,,,,,,,,,,,
6,O,H2O,OH,OH,2.6959031E-12,,,,,,,,,,,,,,,,,,,,,,,,,,,

感谢任何建议。谢谢!

1 个答案:

答案 0 :(得分:2)

这似乎是networkx中的一个错误。转换为基本的python类型解决了这个问题。

import numpy as np
import networkx as nx

total_nodes = 20
nodes = range(total_nodes)

p = 0.1
total_edges = int(p * total_nodes ** 2 )
sources = np.random.choice(nodes, total_edges)
targets = np.random.choice(nodes, total_edges)
weights = np.random.rand(total_edges)

# edge_list = zip(sources, targets, weights) # doesn't work
edge_list = [(int(s), int(t), float(w)) for s, t, w in zip(sources, targets, weights)] # works

G = nx.Graph()
G.add_weighted_edges_from(edge_list)
nx.write_graphml(G, 'test.graphml')

编辑:

显然,他们很清楚这个问题(https://github.com/networkx/networkx/issues/1556),但还没有解决它。