-
Notifications
You must be signed in to change notification settings - Fork 549
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation Stopped After a While #51
Comments
Yes that's expected, once it reaches the max cache size, conv will stop. That should match roughly 5 min of conv. We will try to expand that in the future. |
pruning should work .. i mean llm ctx is removing from the start too and prunes it down .. maybe we can do something like that ? |
Not clear yet if we are going to fix this or not. I'll keep the issue open until we decide :) |
PR #102 should make the MLX conversations ~2x longer (the culprit was that rustymimi only add a max-seq-len of 4096 but the mimi transformer runs at twice the frame rate of the main moshi transformer). |
@LaurentMazare now that the pr is merged, would it be possible to trim all tokens older than n minutes? |
Backend impacted
The MLX implementation
Operating system
Mac OS X
Hardware
Metal with MLX
Description
I received the following stack trace after going through with the conversation (and playing some music), with the following stack trace in the extra information. The conversation stopped after that, with no reply from moshi.
Extra information
Environment
The text was updated successfully, but these errors were encountered: