如何使用openpyxl过滤列数据

时间:2018-08-21 07:18:12

标签: python openpyxl

我正在尝试将过滤器应用于现有的Excel文件,并将其导出到另一个Excel文件。我想提取仅包含值16的行,然后将表导出到另一个excel文件(如下图所示)。

我曾尝试多次阅读openpyxl文档并使用Google搜索解决方案,但我仍然无法使我的代码正常工作。我还随附了

下的代码和文件
import openpyxl
# Is use to create a reference of the Excel to wb
 wb1 = openpyxl.load_workbook('test_data.xlsx')
 wb2 = openpyxl.load_workbook('test_data_2.xlsx')

# Refrence the workbook to the worksheets
 sh1 = wb1["data_set_1"]
 sh2 = wb2["Sheet1"]

 sh1.auto_filter.ref = "A:A"
 sh1.auto_filter.add_filter_column(0, ["16"])
 sh1.auto_filter.add_sort_condition("B2:D6")

 sh1_row_number = sh1.max_row
 sh1_col_number = sh1.max_column

 rangeSelected = []
 for i in range(1, sh1_row_number+1, 1):
     rowSelected = []
     for j in range(1, sh1_col_number+1, 1):
         rowSelected.append(sh1.cell(row = i, column = j))
     rangeSelected.append(rowSelected)

  del rowSelected

 for i in range(1, sh1_row_number+1, 1):
    for j in range(1, sh1_col_number+1, 1):
        sh2.cell(row = i, column = j).value = rangeSelected[i-1][j-1].value

 wb1.save("test_data.xlsx")
 wb2.save("test_data_2.xlsx")

The pictures shows what should be the desire result

1 个答案:

答案 0 :(得分:1)

自动过滤器实际上并不过滤数据,仅用于可视化。 您可能想在遍历工作簿时进行过滤。请注意,使用此代码,我假定您在第二个工作簿中已经有表头。它不会覆盖数据,而是会附加到表中。

import openpyxl
# Is use to create a reference of the Excel to wb
wb1 = openpyxl.load_workbook('test_data.xlsx')
wb2 = openpyxl.load_workbook('test_data_2.xlsx')

# Refrence the workbook to the worksheets
sh1 = wb1["data_set_1"]
sh2 = wb2["data_set_1"]   # use same sheet name, different workbook

for row in sh1.iter_rows():
    if row[0].value == 16:   # filter on first column with value 16
        sh2.append((cell.value for cell in row))     

wb1.save("test_data.xlsx")
wb2.save("test_data_2.xlsx")