Skip to content

Commit

Permalink
Merge pull request #224 from superlinked/robertdhayanturner-patch-3
Browse files Browse the repository at this point in the history
Update retrieval_from_image_and_text.md
  • Loading branch information
robertdhayanturner authored Feb 13, 2024
2 parents b1c8d3f + 4d7cb74 commit 36935df
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/use_cases/retrieval_from_image_and_text.md
Original file line number Diff line number Diff line change
Expand Up @@ -93,7 +93,7 @@ In experiment 4, we look at the performance of models based on [Contrastive Lang

![](assets/use_cases/retrieval_from_image_and_text/clip.png)

_CLIP's high level architecture (above), from_ [_Learning Transferable Visual Models From Natural Language Supervision_](https://arxiv.org/pdf/2103.00020.pdf)
_CLIP's high level architecture (above), from_ [_Learning Transferable Visual Models From Natural Language Supervision_](https://arxiv.org/pdf/2103.00020.pdf)_._

The structure of CLIP encoders (image above) makes them versatile and adaptable to various model architectures for embedding text or image data. In our experiment, we used pretrained models from the [OpenClip leaderboard](https://github.com/mlfoundations/open_clip/blob/main/docs/openclip_results.csv), and applied the Image Encoder to embed the images. Then we evaluated the outcomes.

Expand Down

0 comments on commit 36935df

Please sign in to comment.