将包含多个工作表的xls文件转换为python中的单独csv

时间:2016-09-02 00:01:47

标签: python json csv

我有一个json文件,我在下面附上了。我必须在python中读取json文件。此文件包含我的包含多个工作表的xls文件的路径,需要将其清理并将每个工作表输出为单独的csv文件。关于我怎么做的任何想法?

{ "file":{
               "path":"C:/.../xyz.xlsx",
               "sheetname":"Sheet1"
               "Clean":{             
                 "1":"A",
                 "2":"B",
                 "3":"C"
               },
               "Delete":{
               "1":"D",
               "2":"E"
               },
               "outfile":"C:/.../out_xyz.csv"
               }
}

我提到了下面我附上的一些链接,我还是徒劳无功! Reading JSON from a file?
How can i split an Excel (.xls) file that contains multiple sheets into separate excel files?
Save each sheet in a workbook to separate CSV files

1 个答案:

答案 0 :(得分:0)

这个怎么样?

使用Python和xlrd& xlwt。见http://www.python-excel.org

以下脚本应该执行您想要的操作:

import xlrd, xlwt, sys

def raj_split(in_path, out_stem):
    in_book = xlrd.open_workbook(in_path)
    in_sheet = in_book.sheet_by_index(0)
    first_row = in_sheet.row_values(0)
    # find the rightmost 1 value in the first row
    split_pos = max(
        colx for colx, value in enumerate(first_row) if value == 1.0
        ) + 1
    out_book = xlwt.Workbook()
    out_sheet = out_book.add_sheet("Sheet1", cell_overwrite_ok=True)
    # copy the common cells
    for rowx in xrange(in_sheet.nrows):
        row_vals = in_sheet.row_values(rowx, end_colx=split_pos)
        for colx in xrange(split_pos):
            out_sheet.write(rowx, colx, row_vals[colx])
    out_num = 0
    # for each output file ...
    for out_col in range(split_pos, in_sheet.ncols):
        out_num += 1
        # ... overwrite the `split_pos` column
        for rowx, value in enumerate(in_sheet.col_values(colx=out_col)):
            out_sheet.write(rowx, split_pos, value)
        # ... and save the file.
        out_book.save("%s_%03d.xls" % (out_stem, out_num))

raj_split(*sys.argv[1:3])