GitHub - paluchasz/image-text-retrieval

A repo for cool image, text retrieval using OpenAI's CLIP model. For information about how CLIP works see my article on Medium.

Data

Using the following Flickr 8K dataset from Kaggle. It contains a variety of images each paired with 5 different captions. A few example images and captions can be found in the sample_data folder.

Running Locally

Install all the dependencies with poetry using poetry install. It is recommended to have the virtual environments created inside the project with the poetry config virtualenvs.prefer-active-python true command.
The first step is to pre-compute the image embeddings with the image_text_retrieval/scripts/pre_compute_embeddings.py script. This script can be ran with the pre_compute_embeddings command. It uses dvc to store the script parameters which can be changed in the params.yaml file. It expects a directory of images (data/images by default) and it saves out the embeddings a mapping file in two different files.
To run the backend api use the command image_text_retrieval_api and go to http://0.0.0.0:8000/docs in the browser to get an interactive swagger:
To run the streamlit ui run streamlit run image_text_retrieval/ui/app_ui.py and it should open in the browser. You should see a page which looks like this:

You can search for an image with a description like so

or search for text descriptions with an image like so

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.dvc		.dvc
image_text_retrieval		image_text_retrieval
resources		resources
sample_data		sample_data
.dvcignore		.dvcignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
params.yaml		params.yaml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data

Running Locally

About

Releases

Packages

Languages

paluchasz/image-text-retrieval

Folders and files

Latest commit

History

Repository files navigation

Data

Running Locally

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages