我有一个名为template.xlsx
的 模板 excel文件,其中有多张工作表。我想将单独的.csv
文件中的数据复制到第一张template.xlsx
(名为 data
)并将新文件另存为保留原始模板文件时result.xlsx
。
我想从data
template.xlsx
表格中的第二行开始粘贴数据
这是我目前开发的代码
import pandas as pd
from openpyxl.utils.dataframe import dataframe_to_rows
import openpyxl
from shutil import copyfile
template_file = 'template.xlsx' # Has a header in row 1 already which needs to be skipped while pasting data but it should be there in the output file
output_file = 'result.xlsx'
copyfile(template_file, output_file)
df = pd.read_csv('input_file.csv') #The file which is to be pasted in the template
wb = openpyxl.load_workbook(output_file)
ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'
for r in dataframe_to_rows(df, index=False, header=False):
ws.append(r)
wb.save(output_file)
我无法获得所需的输出
左侧的模板文件(带有一个额外的行)和右侧的输入文件(要复制到模板的数据),如下所示
答案 0 :(得分:3)
实际上并不需要使用shutil模块,因为您可以使用openpyxl.load_workbook加载模板,然后使用其他名称进行保存。
此外,你在for循环中的ws.append(r)
将附加到从template.xlsx中获取的现有数据,听起来你只想保留标题。
我在下面提供了一个完全可重现的示例,用于演示目的创建“template.xlsx”。然后加载'template.xlsx'向其添加新数据并保存为result.xlsx。
from openpyxl import Workbook
from openpyxl import load_workbook
from openpyxl.utils.dataframe import dataframe_to_rows
from openpyxl.chart import PieChart, Reference, Series
import pandas as pd
template_file = 'template.xlsx'
output_file = 'result.xlsx'
#This part creates a workbook called template.xlsx with a sheet called 'data' and sheet called 'second_sheet'
writer = pd.ExcelWriter('template.xlsx', engine='openpyxl')
wb = writer.book
df = pd.DataFrame({'Pie': ["Cream", "Cherry", "Banoffee", "Apple"],
'Sold': [2, 2, 1, 4]})
df.to_excel(writer, index=False, sheet_name='data', startrow=1)
ws = writer.sheets['data']
ws['A1'] = 1
ws['B1'] = 2
ch = PieChart()
labels = Reference(ws, min_col=1, min_row=3, max_row=6)
data = Reference(ws, min_col=2, min_row=3, max_row=6)
ch.series = (Series(data),)
ch.title = "Pies sold"
ws.add_chart(ch, "D2")
ws = wb.create_sheet("Second_sheet")
ws['A1'] = 'This Sheet will not be overwitten'
wb.save(template_file)
#Now we load workbook called template.xlsx modify the 'data' sheet and save under a new name
#template.xlsx has not been modified
df_new = pd.DataFrame({'different_name': ["Blueberry", "Pumpkin", "Mushroom", "Turnip"],
'different_numbers': [4, 6, 2, 1]})
wb = load_workbook(template_file)
ws = wb.get_sheet_by_name('data') #Getting the sheet named as 'data'
rows = dataframe_to_rows(df_new, index=False, header=False)
for r_idx, row in enumerate(rows, 1):
for c_idx, value in enumerate(row, 1):
ws.cell(row=r_idx+2, column=c_idx, value=value)
wb.save(output_file)
预期产出: