ENH creating a memmaping_reusable_executor #286

tomMoral · 2021-02-16T19:40:57Z

Some people would like to benefit from joblib memmaping serialization while still retainning the flexibility of asynchronous computations offered by the concurrent.futures API. This issue intend to keep track of ideas for this feature.

We could expose a memmaping_reusable_executor that would use the serialization from joblib with the classical API. This would require some tweaking to correctly keep track of the memmap files. Talking with @pierreglaser IRL, he proposed the following strategy to keep track of memmap:

One could add references to memmaps in each future, releasing it only once a future is completed.
Never return memmapped objects.

This way, memmaps are only created when sending tasks to the pool of workers and once the futures that depends on it are done, no other process should require them to run.

The text was updated successfully, but these errors were encountered:

jakirkham · 2023-07-06T02:03:44Z

Building off the discussion in issue ( joblib/joblib#705 ), wonder if it would be worth using SharedMemoryManager from Python 3.8+. There's also a backport package for Python 3.6 & 3.7

jakirkham mentioned this issue Jul 6, 2023

Using shm_open joblib/joblib#705

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH creating a memmaping_reusable_executor #286

ENH creating a memmaping_reusable_executor #286

tomMoral commented Feb 16, 2021

jakirkham commented Jul 6, 2023

ENH creating a memmaping_reusable_executor #286

ENH creating a memmaping_reusable_executor #286

Comments

tomMoral commented Feb 16, 2021

jakirkham commented Jul 6, 2023