Question

我使用以下代码打开电子表格。

in_book = xlrd.open_workbook(TPS_XLS_File)
in_sheet = in_book.sheet_by_index(0)

当我尝试访问工作表中的行时，读取开始于从电子表格中跳过前23行的行。
command in_sheet.row(1)实际上是返回25行中的值。从25列读取的数据也不正确，错误消息错误：23。

它读取单元格值如下

print(in_sheet.col(2))
[empty:'', empty:'', text:'Table/File', error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23, error:23

请注意：价值＆＃34;文字：＆＃39;表格/文件＆＃39;＆＃34;表示电子表格中的单元格B27，表示读取从B25开始。

我使用了相同的代码来加载超过1000个文件，除了手上满是这个错误似乎失败的文件之外，所有文件都有效。

非常感谢任何帮助信息或建议。

由于阿尼尔

Answer 1

要使用pandas的大文件。您可以使用pandas的ExcelFile parse方法阅读Excel工作表，请参阅io docs：

xls = pd.ExcelFile('C:\Users\cb\Machine_Learning\cMap_Joins.xlsm')

df = xls.parse('Sheet1', skiprows=4, index_col=None, na_values=['NA'])

skiprows将忽略前4行（即从第5行开始）和几个other options。

Python XLRD模块跳过前几行并读取错误列：23

1 个答案: