Python(openpyxl):将数据从一个excel文件放到另一个(模板文件)&在保留模板的同时用另一个名称保存它

时间:2018-04-02 11:58:18

标签: python excel python-3.x openpyxl shutil

我有一个名为template.xlsx 模板 excel文件,其中有多张工作表。我想将单独的.csv文件中的数据复制到第一张template.xlsx(名为 data )并将新文件另存为保留原始模板文件时result.xlsx

我想从data

template.xlsx 表格中的第二行开始粘贴数据

这是我目前开发的代码

import pandas as pd
from openpyxl.utils.dataframe import dataframe_to_rows
import openpyxl
from shutil import copyfile

template_file = 'template.xlsx' # Has a header in row 1 already which needs to be skipped while pasting data but it should be there in the output file
output_file = 'result.xlsx' 

copyfile(template_file, output_file)
df = pd.read_csv('input_file.csv') #The file which is to be pasted in the template

wb = openpyxl.load_workbook(output_file)
ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'

for r in dataframe_to_rows(df, index=False, header=False):
   ws.append(r)

 wb.save(output_file)

我无法获得所需的输出

左侧的模板文件(带有一个额外的行)和右侧的输入文件(要复制到模板的数据),如下所示

template enter image description here

1 个答案:

答案 0 :(得分:3)

实际上并不需要使用shutil模块,因为您可以使用openpyxl.load_workbook加载模板,然后使用其他名称进行保存。

此外,你在for循环中的ws.append(r)将附加到从template.xlsx中获取的现有数据,听起来你只想保留标题。

我在下面提供了一个完全可重现的示例,用于演示目的创建“template.xlsx”。然后加载'template.xlsx'向其添加新数据并保存为result.xlsx。

from openpyxl import Workbook
from openpyxl import load_workbook
from openpyxl.utils.dataframe import dataframe_to_rows
from openpyxl.chart import PieChart, Reference, Series
import pandas as pd

template_file = 'template.xlsx'
output_file = 'result.xlsx'

#This part creates a workbook called template.xlsx with a sheet called 'data' and sheet called 'second_sheet'
writer = pd.ExcelWriter('template.xlsx', engine='openpyxl') 
wb  = writer.book
df = pd.DataFrame({'Pie': ["Cream", "Cherry", "Banoffee", "Apple"],
                  'Sold': [2, 2, 1, 4]})

df.to_excel(writer, index=False, sheet_name='data', startrow=1)
ws = writer.sheets['data']
ws['A1'] = 1
ws['B1'] = 2

ch = PieChart()
labels = Reference(ws, min_col=1, min_row=3, max_row=6)
data = Reference(ws, min_col=2, min_row=3, max_row=6)
ch.series = (Series(data),)
ch.title = "Pies sold"
ws.add_chart(ch, "D2")

ws = wb.create_sheet("Second_sheet")
ws['A1'] = 'This Sheet will not be overwitten'

wb.save(template_file)

#Now we load workbook called template.xlsx modify the 'data' sheet and save under a new name
#template.xlsx has not been modified

df_new = pd.DataFrame({'different_name': ["Blueberry", "Pumpkin", "Mushroom", "Turnip"],
                  'different_numbers': [4, 6, 2, 1]})

wb = load_workbook(template_file)

ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'

rows = dataframe_to_rows(df_new, index=False, header=False)

for r_idx, row in enumerate(rows, 1):
    for c_idx, value in enumerate(row, 1):
         ws.cell(row=r_idx+2, column=c_idx, value=value)

wb.save(output_file)

预期产出:

Expected Output for the two workbooks