Respect retry-after header from Azure openAI when ratelimited #6686
JonathanVelkeneers
started this conversation in
Ideas
Replies: 1 comment
-
I'm surprised this is not implemented. The use of exponential backoff creates a lot of unnecessary calls and is not effective time-wise |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Checked
Feature request
Azure openAI passes a 'retry-after' header when hitting a ratelimit.
Use this timeout to wait between retries of the ChatModel
Motivation
Currently the retry logic uses an exponential backoff to handle retries.
This works, but fills the console with ratelimit errors. This can be optimized by using the provided wait time from the 'retry-after' header.
Example response:
Proposal (If applicable)
No response
Beta Was this translation helpful? Give feedback.
All reactions