Question

我有3个输出文件，分别包含x，y，z坐标

file1.txt（仅包含x）| file2（Y）| file3（Z）

2.113
3.023
-7.234
...

和包含坐标数据的父pdb文件。我想只从pdb文件中提取那些与file1，file2，file3中的x，y，z坐标匹配的行.pdb文件行是;

ATOM 1 O5＆＃39; G A 93 -12.706 19.299 0.129 1.00 0.00 O

粗体值将是我复制整行的匹配条件。

1-如何合并三个输出文件以制作一个文件，该文件可以在一行中提供x，y，z坐标以使用我的脚本。

final_file = [file1，file2，file3]？

2-如何根据匹配标准提取点数;

def read_convex_points(final_file,parent_pdb):
    with open("output.txt","w") as f1:
        with open(final_file) as f2:
            with open(parent_pdb) as f3:
                for line1 in f2:
                    for line2 in f3: 
                        if line1 in line2:
                            f1.write(line2)
    return               

final_file = "file1.txt"
parent_pdb = "original.pdb"
read_convex_points(final_file,parent_pdb)

我写了类似的功能，但条件不起作用。

Answer 1

您可以合并这样的文件：

def merge(paths):
    with open(paths[0]) as f1, open(paths[1]) as f2, open(paths[2]) as f3:
        try:
            yield next(f1), next(f2), next(f3)
        except StopIteration:
            return

for x, y, z in merge((file1, file2, file3)):
    # check if matching

需要注意的是，假设文件长度相等，所以它会在遇到最短文件时停止。这可能是可以接受的。

Answer 2

这是在Python中粘贴多个文件的一种方法。它处理任意数量的输入文件，但是像Peter的解决方案一样，如果文件具有不同的行数，则当最短的文件用完行时它会停止。

输出文件中的字段由delimiter字符串分隔，默认情况下为空格。

def paste(sources, dest, delimiter=' '):
    fsrc = [open(fname, 'r') for fname in sources]
    fdest = open(dest, 'w')
    for t in zip(*fsrc):
        outline = delimiter.join([line.strip() for line in t]) + '\n'
        fdest.write(outline)
    for f in fsrc:
        f.close()
    fdest.close()


paste(('file1', 'file2', 'file3'), 'final_file')

从文本文件python中写入文本文件

2 个答案: