Question

我有一个文本文件，其中我需要每列，最好是字典或列表，格式为：

N       ID   REMAIN        VERS          
2 2343333   bana           twelve    
3 3549287   moredp       twelve        
3 9383737   hinsila           twelve           
3 8272655   hinsila           eight

我试过了：

crs = open("file.txt", "r")
for columns in ( raw.strip().split() for raw in crs ):  
    print columns[0]

结果='超出索引错误'

也尝试过：

crs = csv.reader(open(file.txt", "r"), delimiter=',', quotechar='|', skipinitialspace=True)
    for row in crs:
                   for columns in row:
                             print columns[3]

这似乎是将每个字符串作为列读取，而不是每个“字”

我想获得四列，即：

2
2343333
bana
twelve

进入单独的词典或列表

任何帮助都很棒，谢谢！

Answer 1

这对我来说很好用：

>>> crs = open("file.txt", "r")
>>> for columns in ( raw.strip().split() for raw in crs ):  
...     print columns[0]
... 
N
2
3
3
3

如果要将列转换为行，请使用zip。

>>> crs = open("file.txt", "r")
>>> rows = (row.strip().split() for row in crs)
>>> zip(*rows)
[('N', '2', '3', '3', '3'), 
 ('ID', '2343333', '3549287', '9383737', '8272655'), 
 ('REMAIN', 'bana', 'moredp', 'hinsila', 'hinsila'), 
 ('VERS', 'twelve', 'twelve', 'twelve', 'eight')]

如果您有空行，请在使用zip之前对其进行过滤。

>>> crs = open("file.txt", "r")
>>> rows = (row.strip().split() for row in crs)
>>> zip(*(row for row in rows if row))
[('N', '2', '3', '3', '3'), ('ID', '2343333', '3549287', '9383737', '8272655'), ('REMAIN', 'bana', 'moredp', 'hinsila', 'hinsila'), ('VERS', 'twelve', 'twelve', 'twelve', 'eight')]

Answer 2

>>> with open("file.txt") as f:
...    c = csv.reader(f, delimiter=' ', skipinitialspace=True)
...    for line in c:
...        print(line)
... 
['N', 'ID', 'REMAIN', 'VERS', ''] #that '' is for leading space after columns.
['2', '2343333', 'bana', 'twelve', '']
['3', '3549287', 'moredp', 'twelve', '']
['3', '9383737', 'hinsila', 'twelve', '']
['3', '8272655', 'hinsila', 'eight', '']

或者，老式的方式：

>>> with open("file.txt") as f:
...     [line.split() for line in f]
...
[['N', 'ID', 'REMAIN', 'VERS'],
 ['2', '2343333', 'bana', 'twelve'],
 ['3', '3549287', 'moredp', 'twelve'],
 ['3', '9383737', 'hinsila', 'twelve'],
 ['3', '8272655', 'hinsila', 'eight']]

获取列值：

>>> l
[['N', 'ID', 'REMAIN', 'VERS'],
 ['2', '2343333', 'bana', 'twelve'],
 ['3', '3549287', 'moredp', 'twelve'],
 ['3', '9383737', 'hinsila', 'twelve'],
 ['3', '8272655', 'hinsila', 'eight']]
>>> {l[0][i]: [line[i] for line in l[1:]]  for i in range(len(l[0]))}
{'ID': ['2343333', '3549287', '9383737', '8272655'],
 'N': ['2', '3', '3', '3'],
 'REMAIN': ['bana', 'moredp', 'hinsila', 'hinsila'],
 'VERS': ['twelve', 'twelve', 'twelve', 'eight']}

Answer 3

with  open("path\sample1.csv") as f:
    for line in f:
        print line

//逐行读取文件

Answer 4

你可以使用这样的列表理解：

with open("split.txt","r") as splitfile:
    for columns in [line.split() for line in splitfile]:
        print(columns)

然后你将把它放在一个二维数组中，允许你以任何你喜欢的方式对它进行分组。

Answer 5

只使用列表列表

import csv

columns = [[] for _ in range(4)]  # 4 columns expected

with open('path', rb) as f:
    reader = csv.reader(f, delimiter=' ')
    for row in reader:
        for i, col in enumerate(row):
            columns[i].append(col)

或者如果列数需要动态增长：

import csv

columns = []

with open('path', rb) as f:
    reader = csv.reader(f, delimiter=' ')
    for row in reader:
        while len(row) > len(columns):
            columns.append([])
        for i, col in enumerate(row):
            columns[i].append(col)

最后，您可以使用以下方式打印列：

for i, col in enumerate(columns, 1):
    print 'List{}: {{{}}}'.format(i, ','.join(col))

Answer 6

这个怎么样？

f = open("file.txt")

for i in f:
    k = i.split()
    for j in k:
        print j

python阅读文本文件

6 个答案: