Question

我有530个“ csv”文件，我需要将所有“ csv”文件合并到一个文件中，并且还需要创建新列。我的创建新列的示例如下：

link,billname,congressman,demorrep,noofdemcos,noofrepcos,noofothercos,progress
https://www.congress.gov/bill/110th-congress/house-bill/238?s=1&r=50501, H.R.238 — 110th Congress (2007-2008),D,"Rep. Waxman, Henry A. [D-CA-30]",1,0,0,Passed
.....

在这里，我首先尝试将所有从第0页到第529页开始的'csv'文件合并到一个文件中，我需要检查帐单名称“众议员Waxman，HenryA。[D-CA-30] ”，并为CA创建新列，为每个变量创建30列

import pandas as pd

for i in range(530):
    csv_paths = ["page"+str(i)+".csv"]

    all_data_frames = [pd.read_csv(csv_path) for csv_path in csv_paths]
    merged_data_frame = pd.concat(all_data_frames)
    merged_data_frame.to_csv("mergedfirst.csv",index=False)

在这里，当我运行此代码时，我只是接收mergedfirst文件，而当我输入“ csv”文件时，与page529相同。

link,billname,congressman,demorrep,noofdemcos,noofrepcos,noofothercos,progress
https://www.congress.gov/bill/110th-congress/house-bill/238?s=1&r=50501, H.R.238 — 110th Congress (2007-2008),D,"Rep. Waxman, Henry A. [D-CA-30]",1,0,0,Passed
.....

我的预期结果是我希望所有page0至page529个csv文件都在一个csv文件中，然后创建新列（检查demorrep [D-CA-30]的内部为CA和30创建新列。

通过检查第一个字符串列，Python数据合并并创建新列

0 个答案: