为什么.readlines()会列出单个字符?

时间:2013-05-17 17:30:00

标签: python list io split

我有一个这种格式的文本文件:

EFF   3500.  GRAVITY 0.00000  SDSC GRID  [+0.0]   VTURB 2.0 KM/S    L/H 1.25                            
  wl(nm)    Inu(ergs/cm**2/s/hz/ster) for 17 mu in 1221 frequency intervals
            1.000   .900  .800  .700  .600  .500  .400  .300  .250  .200  .150  .125  .100  .075  .050  .025  .010
    9.09 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
    9.35 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
    9.61 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
    9.77 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
    9.96 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
   10.20 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
   10.38 0.000E+00     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

...more numbers 

我正在尝试这样做File[0][0]会打印“EFF”这个词,等等。

import sys
import numpy as np
from math import *
import matplotlib.pyplot as plt

print 'Number of arguments:', len(sys.argv), 'arguments.'
print 'Argument List:', str(sys.argv)

z = np.array(sys.argv)          #store all of the file names into array

i = len(sys.argv)           #the length of the filenames array

File = open(str(z[1])).readlines()  #load spectrum file 

for n in range(0, len(File)):
    File[n].split()

for n in range(0, len(File[1])):
    print File[1][n]

但是,它会一直输出单个字符,就好像每个列表索引都是单个字符一样。这也包括空格。我在循环中有split()因为如果我把readlines()。split()它给出了一个错误。

输出:

    E
    F
    F



    3
    5
    0
    0
    .


    G
    R
    A
    V
    I

...ect

我做错了什么?

2 个答案:

答案 0 :(得分:3)

>>> text = """some
... multiline
... text
... """
>>> lines = text.splitlines()
>>> for i in range(len(lines)):
...     lines[i].split()  # split *returns* the list of tokens
...                       # it does *not* modify the string inplace
... 
['some']
['multiline']
['text']
>>> lines   #strings unchanged
['some', 'multiline', 'text']
>>> for i in range(len(lines)):
...     lines[i] = lines[i].split() # you have to modify the list
... 
>>> lines
[['some'], ['multiline'], ['text']]

如果你想要单行,请执行:

>>> words = [line.split() for line in text.splitlines()]
>>> words
[['some'], ['multiline'], ['text']]

使用文件对象应该是:

with open(z[1]) as f:  
    File = [line.split() for line in f]

顺便说一句,你在循环时使用的是反成语。如果你想循环遍历一个iterable,只需执行:

for element in iterable:
    #...

如果您还需要元素的索引使用enumerate

for index, element in enumerate(iterable):
    #...

在你的情况下:

for i, line in enumerate(File):
    File[i] = line.split()

for word in File[1]:
    print word

答案 1 :(得分:1)

你想要这样的东西:

for line in File:
    fields = line.split()
    #fields[0] is "EFF", fields[1] is "3500.", etc.

split()方法返回一个字符串列表,它不会修改被调用的对象。