Question

如何逐行阅读并解析python中的文件？

我是python的新手。

第一行输入是模拟次数。下一行是行数（x），后跟一个空格，后跟列数（y）。下一组y行将包含x个字符，其中一个句点（＆＃39;。＆＃39;）表示空格，单个capitol“A”表示起始代理。

我的代码出错了

Traceback (most recent call last):
    numSims = int (line)
TypeError: int() argument must be a string or a number, not 'list'

感谢您的帮助。

INPUT.TXT

2   --- 2 simulations
3 3  -- 3*3 map
.A.  --map
AA.
A.A
2 2  --2*2 map
AA  --map
.A

def main(cls, args):
    numSims = 0
    path = os.path.expanduser('~/Desktop/input.txt') 
    f = open(path) 
    line = f.readlines() 
    numSims = int (line)
    print numSims
    k=0
    while k < numSims:
        minPerCycle = 1
        row = 0
        col = 0
        xyLine= f.readLines()
        row = int(xyLine.split()[0]) 
        col = int(xyLine.split()[1])
        myMap = [[Spot() for j in range(col)] for i in range(row)] 
        ## for-while
        i = 0
        while i < row:
            myLine = cls.br.readLines()
            ## for-while
            j = 0
            while j < col:
                if (myLine.charAt(j) == 'B'):
                    cls.myMap[i][j] = Spot(True)
                else:
                    cls.myMap[i][j] = Spot(False)
                j += 1
            i += 1

对于Spot.py

Spot.py

class Spot(object):
isBunny = bool()
nextCycle = 0
UP = 0
RIGHT = 1
DOWN = 2
LEFT = 3
SLEEP = 4

def __init__(self, newIsBunny):
    self.isBunny = newIsBunny
    self.nextCycle = self.UP

Answer 1

你的错误很多，以下是我迄今为止发现的错误：

行numSims = (int)line没有按照您的想法行事。 Python没有C强制转换，您需要调用 int类型：
```
numSims = int(line)
```
稍后使用Int的大写拼写：
来复合此错误
```
row = (Int)(xyLine.split(" ")[0])
col = (Int)(xyLine.split(" ")[1])
```
以类似的方式纠正这些：
```
row = int(xyLine.split()[0])
col = int(xyLine.split()[1])
```
由于.split()的默认值是在空格上拆分，因此可以省略" "参数。更好的是，将它们组合成一行：
```
row, col = map(int, xyLine.split())
```
您永远不会增加k，因此您的while k < numSims:循环会一直持续，因此您会收到EOF错误。改为使用for循环：
```
for k in xrange(numSims):
```
您不需要在此功能的任何位置使用while，它们都可以用for variable in xrange(upperlimit):循环替换。
Python字符串没有.charAt方法。请改用[index]：
```
if myLine[j] == 'A':
```
但由于myLine[j] == 'A'是一个布尔测试，你可以简化你的Spot()实例化，如下所示：
```
for i in xrange(row):
    myLine = f.readLine()
    for j in xrange(col):
        cls.myMap[i][j] = Spot(myLine[j] == 'A')
```
在Python中没有必要非常初始化变量。如果您要在下一行分配新值，则可以获得大多数numSims = 0和col = 0行等。
您改为创建'myMap variable but then ignore it by referring to cls.myMap`。
这里没有处理多重地图;文件中的最后一个地图将覆盖任何前面的地图。

重写版本：

def main(cls, args):
    with open(os.path.expanduser('~/Desktop/input.txt')) as f:
        numSims = int(f.readline())
        mapgrid = []
        for k in xrange(numSims):
            row, col = map(int, f.readline().split())  
            for i in xrange(row):
                myLine = f.readLine()
                mapgrid.append([])
                for j in xrange(col):
                    mapgrid[i].append(Spot(myLine[j] == 'A'))
         # store this map somewhere?
         cls.myMaps.append(mapgrid)

Answer 2

虽然Martijn Pieters does a very good job解释了如何使代码更好，但我建议采用完全不同的方法，即使用Monadic Parser Combinator库，例如Parcon。这允许您超越上下文无关语法，并根据当前解析过程提取的信息在运行时轻松修改解析器：

from functools import partial
from parcon import (Bind, Repeat, CharIn, number, End,
                    Whitespace, ZeroOrMore, CharNotIn)

def array(parser, size):
    return Repeat(parser, min=size, max=size)

def matrix(parser, sizes):
    size, sizes = sizes[0], sizes[1:]
    return array(matrix(parser, sizes) if sizes else parser, size)

comment = '-' + ZeroOrMore(CharNotIn('\n')) + '\n'

sims = Bind(number[int],
            partial(array,
                    Bind(number[int] + number[int],
                         partial(matrix,
                                 CharIn('.A')['A'.__eq__])))) + End()

text = '''
2   --- 2 simulations
3 3  -- 3*3 map
.A.  --map
AA.
A.A
2 2  --2*2 map
AA  --map
.A
'''

import pprint
pprint.pprint(sims.parse_string(text, whitespace=Whitespace() | comment))

结果：

$ python numsims.py
[[False, True, False], [True, True, False], [True, False, True]]
[[True, True], [False, True]]

起初它有点像心灵缠绕，就像monadic一样。但表达的灵活性和简洁性非常值得花时间学习单子。

如何逐行阅读并解析python中的文件？

2 个答案: