Checking cache on model group fallbacks #4212

Manouchehri · 2024-06-14T19:04:54Z

Manouchehri
Jun 14, 2024
Collaborator

What happened?

When a call in an LLM group fails, it seems like the cache is checked again. This is pretty wasteful since it's highly unlikely that this will ever result in a cache hit.

I would assume this is a bug across all caching providers, not just S3.

Relevant log output

No response

Twitter / LinkedIn details

https://www.linkedin.com/in/davidmanouchehri/

krrishdholakia · 2024-06-15T04:26:53Z

krrishdholakia
Jun 15, 2024
Maintainer

it seems like the cache is checked again

it seems reasonable to check cache when the cache key changes

we have optimizations around reducing the number of cache get requests - e.g. batching gets for a key - https://docs.litellm.ai/docs/proxy/caching#turn-on-batch_redis_requests

--

i don't view this as a bug, but i'm happy to convert to discussion and see what works best!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checking cache on model group fallbacks #4212

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Checking cache on model group fallbacks #4212

Manouchehri Jun 14, 2024 Collaborator

What happened?

Relevant log output

Twitter / LinkedIn details

Replies: 1 comment

krrishdholakia Jun 15, 2024 Maintainer

Manouchehri
Jun 14, 2024
Collaborator

krrishdholakia
Jun 15, 2024
Maintainer