Parallel CV #283

rth · 2021-06-25T08:23:37Z

It would be interesting if it was possible to run cross-validation in parallel. This was also requested by @zhangJianfeng in #250 (comment)

There are two use-case here,

local training. E.g. for scikit-learn models this would most often be faster
submissions on the server, where currently resources are not optimally used. For instance to avoid CPU oversubsciption we reserve some number of CPU for each worker (via CPU affinity). Then for submissions that don't use multi-processing or threading this results in unused resources. Even for submissions that have some level of parallelism via BLAS for parts of the code, running cross-validation in parallel would likely be an improvement.

There are two potential issues,

currently using TensorFlow with joblib will results in CPU oversubscription because threadpoolctl is not able to limit the number of threads used Limiting threads in TensorFlow joblib/threadpoolctl#84
as mentioned by @albertcthomas some models might not be picklable

In any case having this as a CLI option (disabled by default) for ramp-test could be a start.

The text was updated successfully, but these errors were encountered:

albertcthomas · 2021-06-25T09:23:04Z

Thanks for starting the discussion @rth. This would indeed be a very nice feature.

Provide feedback