Question

我有一个要使用熊猫连接的csv文件列表。

以下是csv文件的示例视图：

注意：第4列-存储纬度第5列-存储经度

store-001,store_name,building_no_060,23.4324,43.3532,2018-10-01 10:00:00,city_1,state_1
store-002,store_name,building_no_532,12.4345,45.6743,2018-10-01 12:00:00,city_2,state_1
store-003,store_name,building_no_536,54.3453,23.3444,2018-07-01 04:00:00,city_3,state_1
store-004,store_name,building_no_004,22.4643,56.3322,2018-04-01 07:00:00,city_2,state_3
store-005,store_name,building_no_453,76.3434,55.4345,2018-10-02 16:00:00,city_4,state_2
store-006,store_name,building_no_456,35.3455,54.3334,2018-10-05 10:00:00,city_6,state_2

当我尝试以上述格式合并多个csv文件时，我发现具有纬度和经度的列首先保存在A2-A30的第一行中，随后是所有其他的列（第1行）。 / p>

以下是我执行连续播放的方式：

masterlist = glob.glob('path') <<- This is the path where all the csv files are stored.

df_v1 = [pd.read_csv(fp, sep=',', error_bad_lines=False).assign(FileName=os.path.basename(fp)) for fp in masterlist] <<-- This also includes the file name in the csv file
df = pd.concat(df_v1, ignore_index=True)
df.to_csv('path'), index=False)  <<-- This stores the final concatenated csv file

任何人都可以指导我为什么串联不能正常工作。谢谢

熊猫-将多个csv文件合并为一个文件时出现的问题

0 个答案: