Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize Multi-Model Configuration Retrieval Using Parallel Execution #1062

Open
wants to merge 1 commit into
base: feature/multi_model_deployment
Choose a base branch
from

Conversation

mrDzurb
Copy link
Member

@mrDzurb mrDzurb commented Feb 7, 2025

Description

https://jira.oci.oraclecorp.com/browse/ODSC-68526

The get_multimodel_compatible_shapes method currently calls get_config sequentially for each model in the model_ids list, leading to inefficiencies and slow execution when handling multiple models.

This update refactors the method to leverage ThreadPoolExecutor for parallel execution, reducing latency and improving response times when retrieving deployment configurations for multiple models.

Additionally, all logic responsible for loading and processing the model deployment configuration has been moved into a separate class, MultiModelDeploymentConfigLoader. This improves code isolation, making it easier to maintain and test in the future.

@oracle-contributor-agreement oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Feb 7, 2025
@mrDzurb mrDzurb requested a review from lu-ohai February 7, 2025 02:13
Copy link
Member

@VipulMascarenhas VipulMascarenhas left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is a pretty nice optimization and separation of logic. lgtm 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
OCA Verified All contributors have signed the Oracle Contributor Agreement.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants