Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ideas for different scheduling strategies #628

Open
philwinder opened this issue Dec 5, 2024 · 1 comment
Open

Ideas for different scheduling strategies #628

philwinder opened this issue Dec 5, 2024 · 1 comment
Labels
enhancement New feature or request scheduler Issues relating to the scheduler

Comments

@philwinder
Copy link
Contributor

  1. Imagine the situation where you are under constant load from a single model type. Then a user comes in with another model type. It will never get scheduled.

  2. Imagine prod. We have a lot of machines. It's annoying that image models are constantly evicted for text models, because they take a while to load. It would be great if we could pin models.

3... more?

@philwinder philwinder added enhancement New feature or request scheduler Issues relating to the scheduler labels Dec 5, 2024
@philwinder
Copy link
Contributor Author

Related to #602

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request scheduler Issues relating to the scheduler
Projects
None yet
Development

No branches or pull requests

1 participant