如何使用pandas和globe将多个.xlsx文件转换为.csv

时间:2018-02-22 17:55:18

标签: python pandas glob

我有一个包含多个.xlsx文件的文件夹JanuaryDataSentToResourcePro。 我想迭代文件夹并将它们全部转换为.csv并保留相同的文件名。

为此,我正在尝试实现glob,但收到错误:TypeError: 'module' object is not callable

import glob
excel_files = glob('*xlsx*')

for excel in excel_files:
    out = excel.split('.')[0]+'.csv'
    df = pd.read_excel(r'''C:\Users\username\Documents\TestFolder\JanuaryDataSentToResourcePro\ResourceProDailyDataset_01_01_2018.xlsx''', 'ResourceProDailyDataset')
    df.to_csv(out) 

我是python的新手。看起来不错吗?

更新:

import pandas as pd
import glob
excel_files = glob.glob("*.xlsx")

for excel in excel_files:
    out = excel.split('.')[0]+'.csv'
    df = pd.read_excel(excel, 'ResourceProDailyDataset')
    df.to_csv(out)

但仍未将.xlsx转换为.csv

1 个答案:

答案 0 :(得分:3)

glob包应该像:

一样使用
import glob
f = glob.glob("*.xlsx")

glob不是方法,但是glob.glob是。

========================================

import glob
excel_files = glob.glob('C:/Users/username/Documents/TestFolder/JanuaryDataSentToResourcePro/*.xlsx') # assume the path
for excel in excel_files:
    out = excel.split('.')[0]+'.csv'
    df = pd.read_excel(excel) # if only the first sheet is needed.
    df.to_csv(out)