在过去几年中,有几篇帖子与parallelization
pandas.apply()
或帖子有关,这些帖子描述了可以通过将数据结构化为数据框并使用pandas.apply()
来解决的问题如果parallelization
已实施。
我在这里向专家社区提问 - 这个功能的状态是R
已经有mclapply
。
目前还没有干净的标准解决方案。重新编写整个函数和脚本以使用建议的解决方法是非常繁琐的。
Python Pandas Multiprocessing Apply
Parallelize apply after pandas groupby
Parallel and Multicore Processing in R
Python multiprocessing pool.map for multiple arguments
passing kwargs with multiprocessing.pool.map
passing arguments and manager.dict to pool in multiprocessing in python 2.7
Is there a simple process-based parallel map for python?
Pandas with rpy2 and multiprocessing
How to asynchronously apply function via Spark to subsets of dataframe?
Efficiently applying a function to a grouped pandas DataFrame in parallel
python dask DataFrame, support for (trivially parallelizable) row apply?
Python multiprocessing job to Celery task but AttributeError
Parallelizing apply function in pandas python. worked on groupby