脚本将多个csv合并为单个xslx无法正常工作

时间:2016-11-25 20:16:02

标签: python csv xlsx

已经阅读了所有的主题,但我还是死路一条。只是尝试将所有csv放在一个目录中,并将它们作为新工作表添加到新的xlsx工作簿中。这就是我所拥有的:

import xlwt, csv, os, glob

def make_excel_workbook(path):
    wb = xlwt.Workbook()
    for filename in os.listdir(folder_path):
        if filename.endswith('.csv'):
            ws = wb.add_sheet(os.path.splitext(filename)[0])
            with open('{}\\{}'.format(folder_path, filename), 'rb') as csvfile:
                reader = csv.reader(csvfile, delimiter=',')
                for rowx, row in enumerate(reader):
                    for colx, value in enumerate(row):
                        ws.write(rowx, colx, value)
    return wb

csvDir = "C:\\Temp\\Data\\outfiles"
outDir = "C:\\Temp\\Data\\output"

os.chdir(csvDir)
csvFileList = []
searchTerm = "character string"

for file in glob.glob('*.csv'):
    csvFileList.append(file)

for i in csvFileList: # search a set of extant csv files for a string and make new csv files filtered on the search term
    csv_file = csv.reader(open(i, 'rb'), delimiter=',')
    rowList = []
    for row in csv_file:
        for field in row:
            if searchTerm in field:
                rowList.append(row)
    outputCsvFile = os.path.join(rootDir, i)
    with open(outputCsvFile, 'wb') as newCsvFile:
        wr = csv.writer(newCsvFile, quoting=csv.QUOTE_ALL)
        wr.writerows(rowList)

到目前为止,它可以工作,并从原始的,更大的csv文件中创建新的csv文件。这是它破裂的地方:

if __name__ == '__main__':
    xls = make_excel_workbook(outDir)
    xls_name = "My_Team_Tasks"
    xls.save('{}\\{}{}.'format(outDir, xls_name, '.xls'))
    print('{}\\{}{} saved successfully'.format(outDir, xls_name, '.xls'))

当它到达xls.save时,它会给我以下错误:

更新:这里是整个追溯:

Traceback (most recent call last):
    File"M:/Testing/scripts/csv_parse.py", line 44, in <module>
        xls.save('{}\\{}{}'.format(rootDir, xls_name, '.xls'))
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 696, in save
        doc.save(filename_or_stream, self.get_biff_data())
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 660, in get_biff_data
        shared_str_table   = self.__sst_rec()
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\Workbook.py", line 662, in __sst_rec
        return self.__sst.get_biff_record()
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 77, in get_biff_record
        self._add_to_sst(s)
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\BIFFRecords.py", line 92, in _add_to_sst
        u_str = upack2(s, self.encoding)
    File "C:\Python27\ArcGIS10.4\lib\site-packages\xlwt\UnicodeUtils.py", line 50, in upack2
        us = unicode(s, encoding)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc2 in position 69: ordinal not in range (128)

1 个答案:

答案 0 :(得分:0)

您知道输入CSV文件是如何编码的吗?它从错误消息中显示为unicode?

您可以尝试:

 df <- data.frame(x,var,es)

 df <- melt(df, id.vars = "x", measure.vars = c("var", "es"))

 ggplot(df, aes(x = x, y = value)) + 
      geom_line(aes(group = variable, color = variable)) +
      scale_colour_manual(values=c("red","blue"))

如果没有这个,根据这个答案(xlwt module - saving xls unicode error),解决此问题的另一种可能方法是在写出之前将文本编码为unicode。

wb = xlwt.Workbook(encoding='utf-8')

同样,这取决于输入的编码方式。