saving dask dataframe in bcolz format

时间:2016-07-11 20:19:23

标签: python dask bcolz

The dask documentation states: "BColz is an on-disk, chunked, compressed, column-store. These attributes make it very attractive for dask.dataframe which can operate particularly well on it. There is a special from_bcolz function."

However, I could not find an example how to save a dask dataframe to bcolz. What is the recommended way to do this?

1 个答案:

答案 0 :(得分:1)

我创建了一个拉取请求来实现它。在它合并到主分支之前,你可以在这里找到它:

https://github.com/dask/dask/pull/1386

如果您不想编辑自己的Dask实现,只需复制TConfigData方法即可。