Question

我应该在序言中说我是一个完整的Python新手。

我正在尝试创建一个脚本，它将遍历目录及其子目录中寻找文本文件。遇到文本文件时，它会解析文件并将其转换为NITF XML并上传到FTP目录。

此时我仍在努力将文本文件读入变量，以便可以将它们插入到正确位置的XML文档中。文本文件的示例如下。

Headline
Subhead
By A person
Paragraph text.

这是我到目前为止的代码：

with open("path/to/textFile.txt") as f:
    #content = f.readlines()
    head,sub,auth = [f.readline().strip() for i in range(3)]
    data=f.read()
pth = os.getcwd()

print head,sub,auth,data,pth

我的问题是：如何遍历文本文件（数据）的主体并将每行包装在HTML P标签中？例如;

<P>line of text in file </P> <P>Next line in text file</p>。

Answer 1

像

这样的东西

output_format = '<p>{}</p>\n'.format
with open('input') as fin, open('output', 'w') as fout:
    fout.writelines( output_format(line.strip()) for line in fin )

Answer 2

with open('infile') as fin, open('outfile',w) as fout:
    for line in fin:
        fout.write('<P>{0}</P>\n'.format(line[:-1])  #slice off the newline.  Same as `line.rstrip('\n')`.

#Only do this once you're sure the script works :)
shutil.move('outfile','infile')  #Need to replace the input file with the output file

Answer 3

这假定您要将新内容写回原始文件：

with open('path/to/textFile.txt') as f:
    content = f.readlines()

with open('path/to/textFile.txt', 'w') as f:
    for line in content:
        f.write('<p>' + line.strip() + '</p>\n')

Answer 4

在你的情况下，你应该替换

data=f.read()

使用：

data = '\n'.join("<p>%s</p>" % l.strip() for l in f)

Answer 5

在这里使用data=f.readlines()，

然后迭代数据并尝试这样的事情：

for line in data:
   line="<p>"+line.strip()+"</p>"
   #write line+'\n' to a file or do something else

Answer 6

附加

和＆lt; \ p＆gt;对于每一行

例如：

     data_new=[]
     data=f.readlines()
     for lines in data:
          data_new.append("<p>%s</p>\n" % data.strip().strip("\n"))

Answer 7

您可以使用fileinput模块就地修改一个或多个文件，并根据需要创建可选的备份文件（有关详细信息，请参阅其documentation）。这是用于处理一个文件。

import fileinput

for line in fileinput.input('testinput.txt', inplace=1):
    print '<P>'+line[:-1]+'<\P>'

'testinput.txt'参数也可以是两个或多个文件名的序列，而不仅仅是一个文件名，这可能很有用，尤其是当您使用os.walk()生成文件列表时要处理的目录及其子目录（正如您可能应该做的那样）。

Python-需要将字符附加到文本文件中每行的开头和结尾

7 个答案: