Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parallel CV #283

Open
rth opened this issue Jun 25, 2021 · 1 comment
Open

Parallel CV #283

rth opened this issue Jun 25, 2021 · 1 comment

Comments

@rth
Copy link
Collaborator

rth commented Jun 25, 2021

It would be interesting if it was possible to run cross-validation in parallel. This was also requested by @zhangJianfeng in #250 (comment)

There are two use-case here,

  • local training. E.g. for scikit-learn models this would most often be faster
  • submissions on the server, where currently resources are not optimally used. For instance to avoid CPU oversubsciption we reserve some number of CPU for each worker (via CPU affinity). Then for submissions that don't use multi-processing or threading this results in unused resources. Even for submissions that have some level of parallelism via BLAS for parts of the code, running cross-validation in parallel would likely be an improvement.

There are two potential issues,

In any case having this as a CLI option (disabled by default) for ramp-test could be a start.

@albertcthomas
Copy link
Collaborator

Thanks for starting the discussion @rth. This would indeed be a very nice feature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants