Use More Embedding Models in Similarity Search for Semantic Cache

**Is your feature request related to a problem? Please describe.**
Currently the semantic cache uses `all-MiniLM-L12-v2` embedding model in semantic cache. This model supports max seq up to 512. This works for short prompts.

For long prompts, other embedding models need to be explored.

**Describe the solution you'd like**
Document the limitation or support embedding models with longer max seq.

**Describe alternatives you've considered**

**Additional context**
#59 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use More Embedding Models in Similarity Search for Semantic Cache #106

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Use More Embedding Models in Similarity Search for Semantic Cache #106

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions