Python - 将边缘列表转换为邻接矩阵

时间:2013-10-03 05:51:41

标签: python

我的数据格式如下:

user,item,rating
1,1,3
1,2,2
2,1,2
2,4,1

等等 我想以矩阵形式转换它

所以,输出就像这样

Item--> 1,2,3,4....
user
1       3,2,0,0....
2       2,0,0,1

....等等..

我如何在python中执行此操作?

感谢

2 个答案:

答案 0 :(得分:2)

data = [
    (1,1,3),
    (1,2,2),
    (2,1,2),
    (2,4,1),
]

#import csv
#with open('data.csv') as f:
#    next(f) # Skip header
#    data = [map(int, row) for row in csv.reader(f)]
#    # Python 3.x: map(int, row) -> tuple(map(int, row))

n = max(max(user, item) for user, item, rating in data) # Get size of matrix
matrix = np.zeros((n, n))
for user, item, rating in data:
    matrix[user-1][item-1] = rating # Convert to 0-based index.

for row in matrix:
    print(row)

打印

[3, 2, 0, 0]
[2, 0, 0, 1]
[0, 0, 0, 0]
[0, 0, 0, 0]

答案 1 :(得分:1)

与@falsetru不同的方法,

你是从文件中读取文件吗?

可以使用字典

from collections import defaultdict
valdict=defaultdict(int)
nuser=0
nitem=0
for line in infile:
    eachline=line.strip().split(",")
    valdict[tuple(eachline[0:2])]=eachline[2]
    nuser=max(nuser,eachline[0])
    nitem=max(nitem,eachline[1])

towrite=",".join(range(1,nuser+1))+"\n"
for i in range(1:nuser+1):
    towrite+=str(i)
    for j in range(1:nitem+1):
        towrite+=","+str(valdict[i,j])
    towrite+="\n"

outfile.write(towrite)