You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Currently the semantic cache uses all-MiniLM-L12-v2 embedding model in semantic cache. This model supports max seq up to 512. This works for short prompts.
For long prompts, other embedding models need to be explored.
Describe the solution you'd like
Document the limitation or support embedding models with longer max seq.