mrl_and_binary_embeddings

combining matryoshka embeddings and binary embeddings for more scalable searches

Details

MRL - Matryoshka Representation Learning is an approach that helps us obtain varying dimensional representations/vectors instead of a fixed dimension vector. This is achieved by diffusing the information in intermediate dimensions rather than diffusing it along the entire dimension.

Adaptive Retrieval is an important process for efficient retrieval of vectors. With MRL (Matryoshka Representation Learning), it is as simple as performing a single forward pass on the neural network, resulting in a collection of representations with different dimensions.

BINARY EMBEDDING - it is a technique used to reduce the precision of a number to 1 bit i.e b = sign(x)

sign = 1, x >= 0 and 0, x < 0

example : x= 0.3

1 =sign(x)

RERANKING WITH BINARY EMBEDDINGS - these is basically a reranking step where the candidate documents that are retrieved by binary embeddings are reranked.

     Inner_Product(float_query,binary_doc)

Simply put, the dot product of the fp32 query vector and the binary vector of the candidate document indicates how similar the float vector and the binary vector are, or in other words the similarity between query(float vector) and candidate document(binary vector). This similarity measure helps in reranking the candidate documents.

Both of these techniques are independent of each other and can be easily combined for superior storage savings and fast vector processing.

References

MRL Paper: Matryoshka Representation Learning (MRL)
MRL Blog: Matryoshka Representation Learning (MRL) Blog
Binary Embeddings Blog: Embedding Quantization Blog

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
jupyter_notebooks		jupyter_notebooks
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mrl_and_binary_embeddings

Details

References

About

Releases

Packages

Languages

nis12ram/mrl_and_binary_embeddings

Folders and files

Latest commit

History

Repository files navigation

mrl_and_binary_embeddings

Details

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages