Replies: 3 comments 5 replies
-
All Prompt Drivers implement the ExponentialBackoffMixin which means they should automatically do this. How is the Groq Driver implemented? |
Beta Was this translation helpful? Give feedback.
-
It's just using OpenAiChatPromptDriver with a different URL. Is the min_retry_delay in seconds? I wonder if I can try setting that to be a bit longer? |
Beta Was this translation helpful? Give feedback.
-
Hmm.. I tried setting it to 10, but I don't know if this is actually the right thing? the problem is hitting it too often when agents go back and forth, or are asking multiple agents in parallel.. so it's not retrying a call because of an error, but it's an agent that's using another agent (or multiple agents) as tools.. so it's just hitting the api too frequently. |
Beta Was this translation helpful? Give feedback.
-
Some of our ComfyUI users who are using Groq are getting hit by rate limits. They had a suggestion to use a delay of a second to help stop that from happening. There's a ControlFlowUtility that has a wait node which we could use, but those wouldn't work with subtasks - only between agent runs.
How might we look at putting in a delay in the framework to stop ratelimits? is that something that would go on a driver? or should it be part of a task?
Beta Was this translation helpful? Give feedback.
All reactions