Python:创建从注释中剥离的文件副本

时间:2017-02-09 10:52:30

标签: python file-io

仍然是Python新手,试图按照书中的例子。这应该创建一个文本文件副本,从#注释开始的所有行中删除。它就像(包括我的实习生评论):

# this should be able to (create a file if not present yet and) take a file and then write another one with the same contents stripped off comments
# I wasnt able to bring it into a proper work - resulting file empty

f = open("test.dat","w")
# write several lines (the new-line n-symbol)
f.write("line one\nline two\nline three\n# blah blah \n# blah")
#
f.close()
# readline method reads all the characters up to and including the next newline character:
f = open("test.dat","r")
print ( f.read() )
print()
# readlines returns lines including newline character


newf = open("test2.dat","w")
newf.close()
newf = open("test2.dat","r")

while True:
  text = f.readline()
  if text == "":
    break
  if text == "#":
    continue
  newf.write(text)
f.close()
newf.close()

print()

newf = open("test2.dat","r")
print (newf.read())
newf.close()

但是生成的文件是空的并且有0b。我可以谦虚地问出什么问题吗?谢谢!

1 个答案:

答案 0 :(得分:1)

您的代码有几个问题:

  • 您打开输入文件进行阅读并在print(f.read())内消耗了所有内容;文件指针现在位于文件的末尾。

  • 打开输出文件进行写入 - 但随后立即关闭,这会创建一个空文件。然后打开此空文件以进行阅读

  • 你的循环一开始就退出,因为文件末尾的readline()将返回一个空字符串''

  • 您的if不会检查每一行的第一个字符 - 而是将整行与#相匹配。由于该行还包含换行符,因此某一行上的#都不符合此条件(readline将返回'#\n'

您案例的惯用代码可能是

with open('test.dat', 'w') as output_file:
    # write several lines (the new-line n-symbol)
    output_file.write("line one\nline two\nline three\n# blah blah \n# blah")
# file closed automatically

with open('test.dat') as input_file:
    print(input_file.read())
    print()
# closed automatically

# reopen input file, open output file
with open('test.dat') as input_file, open('test2.dat', 'w') as output_file:
    for line in input_file:
        if not line.startswith('#'):
            output_file.write(line) 
# both files again closed automatically at the end of with block

print('Contents of test2.dat are now:')
with open('test2.dat') as input_file:
    print(input_file.read())