Question

我正在使用pandas使用HDFStore加入几个巨大的csv文件。我将所有其他表合并到基表base。现在我在HDFStore中为每个合并的输出创建一个新表，我称之为temp。然后我删除旧的基表。最后，我将temp复制到base，然后在我需要加入的下一个表格上重新开始这个过程。

如果我可以简单地将temp重命名为base，那么效率会更高。这可能吗？

Answer 1

是的，有可能。您必须深入研究HDFStore所依赖的methods from PyTables。

Out[20]: 
<class 'pandas.io.pytables.HDFStore'>
File path: test.h5
/a            frame        (shape->[3,1])

In [21]: store.get_node('a')._f_rename('b')

In [22]: store
Out[22]: 
<class 'pandas.io.pytables.HDFStore'>
File path: test.h5
/b            frame        (shape->[3,1])

相同的方法适用于frame_table可附加节点。

在pandas hdfstore中重命名表

1 个答案: