Multiple LiteLLM instances #2032

olad32 · 2024-02-17T13:59:39Z

olad32
Feb 17, 2024

Hi, is this possible to run multiple instances in parallel to be able to scale horizontaly ? Even with the database features in use ?
Is there anything to know to be able to rollout a new LiteLLM config on multiple instances without downtime ? Eg update model config
Thanks

krrishdholakia · 2024-02-17T16:06:35Z

krrishdholakia
Feb 17, 2024
Maintainer

Hey @olad32 you should be able to do this. I believe you'd need to change the db connection pool limit for a single instance, to do this well (we currently set it to 100, which could be the max for some systems).

We have a db table for the LLM config that we were planning on using for this.

A problem I was trying to figure out was:

models need keys. Should the table store the keys?

0 replies

krrishdholakia · 2024-02-17T16:07:25Z

krrishdholakia
Feb 17, 2024
Maintainer

If you have time today, would love to do a quick call and talk through this:

https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

0 replies

krrishdholakia · 2024-02-18T03:06:36Z

krrishdholakia
Feb 18, 2024
Maintainer

@olad32 just pushed a fix to let you control db connection pool + timeouts for better scalability - https://docs.litellm.ai/docs/proxy/configs#configure-db-pool-limits--connection-timeouts

Should be out in the next release.

Would love to do a quick call and talk through the reload config file issue - https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Let me know if any time this / next week works!

0 replies

olad32 · 2024-02-19T16:31:02Z

olad32
Feb 19, 2024
Author

Thanks for the configurable pool options.
Regarding the config reload topic, the static config.yaml is the simplest and most performant option for sure, but it would add flexibility to have an option in litellm proxy to be able to update config at runtime. Persisting the config in a database is one solution, litellm proxy could periodically check for updated config in the database, this would not alter performance too much.
As a side note I just saw that the proxy API has a new model endpoint, if this only update the static config in memory, it will not work in multi litellm proxy instance environnement behind a Load Balancer.

For now another option exists via the rolling update kuberntes strategy which can recreate each pod (ie litellm proxy instance) one by one with the new static config.yaml, but it generates a bit of noise on the cluster (recreate each pod), not ideal but manageable. One thing is mandatory for this option to work, litellm proxy must handle gracefull shutdown, ie handle sigterm signal sent by kubernetes and wait for current requests to end before effectively shutting down, especially usefull for long streaming response. In fact gracefulll shutdown is always a good thing to handle.

4 replies

olad32 Feb 19, 2024
Author

(i can't find a schedule to make a call as we are obviously at the earth opposite ;) )

ishaan-jaff Feb 19, 2024
Maintainer

@olad32 we're on this call if you're free right now: https://meet.google.com/zav-wexx-tbe

krrishdholakia Feb 26, 2024
Maintainer

Hi @olad32 i guess the question was:

if you do /config/update and add a new model to the config, would that be stored in the db?

Curious - what're you trying to update about litellm proxy? (is it just adding new models?)

olad32 Feb 28, 2024
Author

Hi @krrishdholakia, exactly, i want to be able to update existing models parameters (like rpm per model) and add new models, and the new configuration must be applied by all litellm proxy instances that runs in parrallel. This is needed to be able to auto scale LiteLLM itself and LLMs instances upstream (add a new llm instance).

ishaan-jaff · 2024-02-20T18:04:24Z

ishaan-jaff
Feb 20, 2024
Maintainer

Hi @olad32 just wanted to follow up

If we add the ability to add new models through the Admin UI, would you be able to try it and give us feedback ?
I'd love to set up a direct 1:1 support channel for you, feel free to pick the channel that works best for you

Linkedin https://www.linkedin.com/in/reffajnaahsi/
Twitter: https://twitter.com/ishaan_jaff
Discord: https://discord.com/invite/wuPM9dRgDw
Other - please let me know if you'd prefer another channel

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiple LiteLLM instances #2032

{{title}}

Replies: 5 comments 4 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Multiple LiteLLM instances #2032

olad32 Feb 17, 2024

Replies: 5 comments · 4 replies

krrishdholakia Feb 17, 2024 Maintainer

krrishdholakia Feb 17, 2024 Maintainer

krrishdholakia Feb 18, 2024 Maintainer

olad32 Feb 19, 2024 Author

olad32 Feb 19, 2024 Author

ishaan-jaff Feb 19, 2024 Maintainer

krrishdholakia Feb 26, 2024 Maintainer

olad32 Feb 28, 2024 Author

ishaan-jaff Feb 20, 2024 Maintainer

olad32
Feb 17, 2024

Replies: 5 comments 4 replies

krrishdholakia
Feb 17, 2024
Maintainer

krrishdholakia
Feb 17, 2024
Maintainer

krrishdholakia
Feb 18, 2024
Maintainer

olad32
Feb 19, 2024
Author

olad32 Feb 19, 2024
Author

ishaan-jaff Feb 19, 2024
Maintainer

krrishdholakia Feb 26, 2024
Maintainer

olad32 Feb 28, 2024
Author

ishaan-jaff
Feb 20, 2024
Maintainer