Python脚本跳过excel文件中的标题行

时间:2013-05-31 14:43:41

标签: python excel header skip

我写了一个python脚本,它将从文件夹中提取excel文件并将它们写入SQL表。我得到了代码,但只有当我删除包含标题的excel文件的第一行时。我是Python的新手,所以这可能很简单,但我看了很多不同的技术,无法弄清楚如何将它插入到我的代码中。任何想法将不胜感激!

# Import arcpy module
from xlrd import open_workbook ,cellname
import arcpy
import pyodbc as p

# Database Connection Info
server = "myServer"
database = "my_Tables"
connStr = ('DRIVER={SQL Server Native Client 10.0};SERVER=' + server + ';DATABASE=' + database + ';' + 'Trusted_Connection=yes')

# Assign path to Excel file
file_to_import = '\\\\Location\\Report_Test.xls'

# Assign column count
column_count=10

# Open entire workbook
book = open_workbook(file_to_import)

# Use first sheet
sheet = book.sheet_by_index(0)

# Open connection to SQL Server Table
conn = p.connect(connStr)

# Get cursor
cursor = conn.cursor()

# Assign the query string without values once, outside the loop
query = "INSERT INTO HED_EMPLOYEE_DATA (Company, Contact, Email, Name, Address, City, CentralCities, EnterpriseZones, NEZ, CDBG) VALUES (?, ?, ?, ?, ?, ?, ?, ?, ?, ?)"

# Iterate through each row

for row_index in range(sheet.nrows):

    row_num          = row_index
    Company          = sheet.cell(row_index,0).value
    Contact          = sheet.cell(row_index,1).value
    Email            = sheet.cell(row_index,2).value
    Name             = sheet.cell(row_index,3).value
    Address          = sheet.cell(row_index,4).value
    City             = sheet.cell(row_index,5).value
    CentralCities    = sheet.cell(row_index,6).value
    EnterpriseZones  = sheet.cell(row_index,7).value
    NEZ              = sheet.cell(row_index,8).value
    CDBG             = sheet.cell(row_index,9).value

    values = (Company, Contact, Email, Name, Address, City, CentralCities, EnterpriseZones, NEZ, CDBG)

    cursor.execute(query, values)

# Close cursor
cursor.close()

# Commit transaction
conn.commit()

# Close SQL server connection
conn.close()

2 个答案:

答案 0 :(得分:5)

您可以在第二行初始化迭代。请尝试以下方法:

for row_index in range(1,sheet.nrows):

编辑:如果你需要迭代一个.xls文件列表,正如你在评论中提到的那样,基本的想法是对文件执行外部循环。这里有一些提示:

# You need to import the os library. At the beinning of your code
import os

...
# Part of your code here
...

# Assign path to Excel file
#file_to_import = '\\\\Location\\Report_Test.xls'
folder_to_import = '\\\\Location'
l_files_to_import = os.listdir(folder_to_import)
for file_to_import in l_files_to_import:
    if file_to_import.endswith('.xls'):
        # The rest of your code here. Be careful with the indentation!
        column_count=10
        ...

答案 1 :(得分:0)

或者如果您很懒:

rows = 0
for row in ws.rows :
    if rows == 0 : 
        rows += 1
        continue
    ...

优点是您不必傻于复杂的单元格取消引用:您拥有行对象,并且只需执行row[col].value。整洁。