-
Notifications
You must be signed in to change notification settings - Fork 1.2k
fix(virtual-quota): display active model in UI for the frontend #2893
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
🦋 Changeset detectedLatest commit: a3d9802 The changes in this PR will be included in the next version bump. This PR includes changesets to release 1 package
Not sure what this means? Click here to learn what changesets are. Click here if you're a maintainer who wants to add another changeset to this PR |
Hey matt @mcowger, would you mind testing this PR? tagging you here because you are the creator of the feature |
Will give it a shot tomorrow. Thanks for the ping. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I took a try with this. It does appear to switch the displayed information for the active model in the task timeline view, but only for the first change.
E.g., If I configured:
- qwen3-coder-plus (128kl)
- gemini-2.5-flash (1M)
- gpt-5-codex (400k)
- qwen3-coder (262K)
The context timeline at the top shfts from 128K to 1M as expected on the first transition. However, subsequent transitions occur (in that the request is sent to the correct profile), but the state of the context tracker doesn't change and remains at 1M context.
@mcowger Screen.Recording.2025-10-15.at.06.24.37.mov |
Very strange. I'll give it another try tonight and see if I can grab some logs. |
I was wrong - I think I may have been testing in the wrong window. Confirmed both the context window tracker, as well as the quick picker by the chat window change as expected. This also appears to fix the context condensing bug. Well done! The only thing I think you need here is the @chrarnoldus this works well for me! |
well I was actually working on the context condensing bug at first, then I thought, why not also fix the UI |
@chrarnoldus any chance this will be merged? |
Co-authored-by: Christiaan Arnoldus <[email protected]>
Context
Help user to know the active model while using the virtual quota fallback with correct model name and correct context window
Implementation
When the backend switches the model, it now sends out a "model has changed" signal by emitting event.
The main application logic catches this signal and immediately tells the user interface to refresh itself.
The user interface then updates the display to show the name of the new, currently active model.
This will also keep the backend and the frontend active model in sync
Screenshots
How to Test
Use virtual quota fallback as a provider
Get in Touch
ivanarifin