Question

亲爱的所有人）我有一个问题对我来说很难，因为我刚开始做Python。所以，让我们假设我们有一个2列的文件，如：

col_1 col_2
1     6 
2     7
3     8
4     9
5     10

我需要执行几次转换（使用不同的等式），这将创建额外的列（基本上，大约15列）。但我不确定如何将其包装为可读和逻辑代码。

让我告诉你我的想法（我确信他们错了，但我真的希望你的眼睛不会开始流血:)）。首先，这个问题是否适合使用类？或只是功能？

INPUT = 'input.txt'
OUTPUT = 'output.txt'

def col_3_function():
    with open(INPUT, 'r') as reader, open(OUTPUT, 'a') as writer:
        for line in reader:
            global col_3
            column = line.strip().split()
            col_1 = float(column[1])
            col_2 = float(column[2])
            col_3 = (col_1 + col_2)

def col_4_function():
    with open(INPUT, 'r') as reader, open(OUTPUT, 'a') as writer:
        for line in reader:
            global col_4
            column = line.strip().split()
            col_1 = float(column[1])
            col_2 = float(column[2])
            col_4 = col_3 - col_2
            print(col_1, col_2, col_3, col_4, sep='\t', file=writer)

if __name__ == '__main__':
    col_4_function()

依此类推，直到有所需数量的列。

我有几个绊脚石：

即使在这个简单的版本中它也不起作用：）
据我所知，使用很多全局变量是一种移动。
每个下一列都应该使用数据，也不仅仅使用第1列和第2列，而是使用之前在column_3，column_4中创建的数据，等等。
我也不喜欢每个功能的重复性。不是Python方法。
我应该上课吗？还是只是功能？应该是什么样的？

对我来说还有很多其他困难，但我应该从最普遍的开始。

我知道这是一个非常普遍而且很重要的问题，但这对我来说非常重要。我真的很感谢你的想法和想法。真。

Answer 1

这是一个基本的approche。它不处理文件中的第一行（我不知道你想如何命名列，所以我没有这样做:)）

INPUT = "toto.txt"
OUTPUT = "titi.txt"

def col3_fn(columns):
    """ Just sum column 1 and 2. Used as generator of column 3 content """
    return int(columns[0]) + int(columns[1])

def col4_fn(columns):
    """ Difference between column 2 and 3. Used as generator of column 4 content """
    return int(columns[1]) - int(columns[2])

# List of functions used for column generation.
# You can add as much as you want.
functions = [col3_fn, col4_fn]

with open(INPUT, "r") as inp, open(OUTPUT, "w") as out:
    for line in inp.readlines():
        splited = line[:-1].split()

        for f in functions:
            splited.append(str(f(splited)))

        out.write("\t".join(splited) + "\n")

输入文件（ toto.txt ）：

1 1
2 2
3 3

输出文件（ titi.txt ）：

1   1   2   -1
2   2   4   -2
3   3   6   -3

Answer 2

你应该使用numpy

import numpy as np
col1, col2 = np.loadtxt (INPUT, dtype=int, unpack=True)
col3 = col1 + col2
col4 = col3 - col2
np.savetxt (OUTPUT, np.vstack((col1, col2, col3, col4)).T, fmt='%d')

如果您使用浮动广告，则不需要dtype和fmt参数

Python - 将2列文件转换为10列

2 个答案: