CLIP

An example of OpenAI's CLIP in MLX. The CLIP (contrastive language-image pre-training) model embeds images and text in the same space.¹

Setup

Install the dependencies:

pip install -r requirements.txt

Next, download a CLIP model from Hugging Face and convert it to MLX. The default model is openai/clip-vit-base-patch32.

python convert.py

The script will by default download the model and configuration files to the directory mlx_model/.

Run

You can use the CLIP model to embed images and text.

from PIL import Image
import clip

model, tokenizer, img_processor = clip.load("mlx_model")
inputs = {
    "input_ids": tokenizer(["a photo of a cat", "a photo of a dog"]),
    "pixel_values": img_processor(
        [Image.open("assets/cat.jpeg"), Image.open("assets/dog.jpeg")]
    ),
}
output = model(**inputs)

# Get text and image embeddings:
text_embeds = output.text_embeds
image_embeds = output.image_embeds

Run the above example with python clip.py.

To embed only images or only the text, pass only the input_ids or pixel_values, respectively.

This example re-implements minimal image preprocessing and tokenization to reduce dependencies. For additional preprocessing functionality, you can use transformers. The file hf_preproc.py has an example.

MLX CLIP has been tested and works with the following Hugging Face repos:

You can run the tests with:

python test.py

To test new models, update the MLX_PATH and HF_PATH in test.py.

Attribution

assets/cat.jpeg is a "Cat" by London's, licensed under CC BY-SA 2.0.
assets/dog.jpeg is a "Happy Dog" by tedmurphy, licensed under CC BY 2.0.

Refer to the original paper Learning Transferable Visual Models From Natural Language Supervision or blog post ↩

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
images		images
store.db		store.db
templates		templates
.gitignore		.gitignore
README.md		README.md
clip.py		clip.py
convert.py		convert.py
hf_preproc.py		hf_preproc.py
image_processor.py		image_processor.py
model.py		model.py
requirements.txt		requirements.txt
server.py		server.py
test.py		test.py
tokenizer.py		tokenizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIP

Setup

Run

Attribution

About

Releases

Packages

Languages

laokiea/mlx-clip

Folders and files

Latest commit

History

Repository files navigation

CLIP

Setup

Run

Attribution

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages