Question

我使用下面引用的代码使用Python编辑csv。代码中调用的函数形式代码的上半部分。

问题：我希望下面引用的代码从第2行开始编辑csv，我希望它排除包含标题的第1行。现在它只在第一行应用函数，我的标题行正在改变。

in_file = open("tmob_notcleaned.csv", "rb")
reader = csv.reader(in_file)
out_file = open("tmob_cleaned.csv", "wb")
writer = csv.writer(out_file)
row = 1
for row in reader:
    row[13] = handle_color(row[10])[1].replace(" - ","").strip()
    row[10] = handle_color(row[10])[0].replace("-","").replace("(","").replace(")","").strip()
    row[14] = handle_gb(row[10])[1].replace("-","").replace(" ","").replace("GB","").strip()
    row[10] = handle_gb(row[10])[0].strip()
    row[9] = handle_oem(row[10])[1].replace("Blackberry","RIM").replace("TMobile","T-Mobile").strip()
    row[15] = handle_addon(row[10])[1].strip()
    row[10] = handle_addon(row[10])[0].replace(" by","").replace("FREE","").strip()
    writer.writerow(row)
in_file.close()    
out_file.close()

我尝试通过将row变量初始化为1来解决此问题，但它无效。

请帮我解决这个问题。

Answer 1

您的reader变量是一个可迭代的变量，通过循环它可以检索行。

要让它在循环之前跳过一个项目，只需调用next(reader, None)并忽略返回值。

您还可以稍微简化一下代码;使用打开的文件作为上下文管理器来自动关闭它们：

with open("tmob_notcleaned.csv", "rb") as infile, open("tmob_cleaned.csv", "wb") as outfile:
   reader = csv.reader(infile)
   next(reader, None)  # skip the headers
   writer = csv.writer(outfile)
   for row in reader:
       # process each row
       writer.writerow(row)

# no need to close, the files are closed automatically when you get to this point.

如果您想将标题写入未处理的输出文件，那也很容易，将next()的输出传递给writer.writerow()：

headers = next(reader, None)  # returns the headers or `None` if the input is empty
if headers:
    writer.writerow(headers)

Answer 2

解决这个问题的另一种方法是使用DictReader类，它跳过＆＃34;跳过＆＃34;标题行并使用它来允许命名索引。

鉴于＆＃34; foo.csv＆＃34;如下：

FirstColumn,SecondColumn
asdf,1234
qwer,5678

像这样使用DictReader：

import csv
with open('foo.csv') as f:
    reader = csv.DictReader(f, delimiter=',')
    for row in reader:
        print(row['FirstColumn'])  # Access by column header instead of column number
        print(row['SecondColumn'])

Answer 3

执行row=1不会改变任何内容，因为你只是用循环结果覆盖它。

您希望next(reader)跳过一行。

Answer 4

受到马丁·彼得（Martijn Pieters）的回应的启发。

如果只需要从csv文件中删除标头，则可以使用标准的Python文件I / O库进行编写，从而避免使用CSV Python库进行编写，从而提高工作效率：

with open("tmob_notcleaned.csv", "rb") as infile, open("tmob_cleaned.csv", "wb") as outfile:
   next(infile)  # skip the headers
   outfile.write(infile.read())

使用Python编辑csv文件时跳过标题

4 个答案: