feat: add support for image embeddings #1753

collindutter · 2025-02-20T00:13:18Z

I have read and agree to the contributing guidelines.

Describe your changes

Sorry for big-ish PR, lots of overlapping changes here:

Add support for embedding images in Embeddings Drivers. Also refactors Embedding Drivers to have a single embed method rather than type-specific methods (embed_string, embed_image, etc).
a. Add support for embedding images in Amazon Bedrock Titan Embedding Driver.
b. Add support for embedding images in VoyageAi Embedding Driver.
Add support for querying/upserting with Vector Store Drivers. Vector Store Driver must be configured with a multi-modal Embedding Driver. Also refactors Vector Store Drivers to add single upsert method rather than type-specific methods (upsert_text, upsert_text_artifact, etc).
a. Add support to Local Vector Store Driver to save Entrys with Image Artifacts into the persist_file.

Issue ticket number and link

Closes #1729

codecov · 2025-02-20T00:31:15Z

Codecov Report

Attention: Patch coverage is 96.15385% with 3 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...riptape/drivers/vector/base_vector_store_driver.py	91.30%	0 Missing and 2 partials ⚠️
...riptape/drivers/embedding/base_embedding_driver.py	92.85%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

…beddings

…dings

emjay07 · 2025-02-28T23:58:48Z

docs/griptape-framework/drivers/embedding-drivers.md

- [embed_string()](../../reference/griptape/drivers/embedding/base_embedding_driver.md#griptape.drivers.embedding.base_embedding_driver.BaseEmbeddingDriver.embed_string) for any string.
-
-You can optionally provide a [Tokenizer](../misc/tokenizers.md) via the [tokenizer](../../reference/griptape/drivers/embedding/base_embedding_driver.md#griptape.drivers.embedding.base_embedding_driver.BaseEmbeddingDriver.tokenizer) field to have the Driver automatically chunk the input text to fit into the token limit.
+Embeddings in Griptape are multidimensional representations of text and image data. Embeddings carry semantic information, which makes them useful for extracting relevant chunks from large bodies of text for search and querying.


would "text or image data" be better as they are embedding separately from each other?

In the last sentence, we should include "images" somehow so:

which makes them useful for extracting relevant information from large bodies of text or image data for search and querying.

emjay07 · 2025-03-01T00:13:42Z

griptape/drivers/vector/base_vector_store_driver.py

@@ -40,18 +36,80 @@ def to_artifact(self) -> BaseArtifact:

    def upsert_text_artifacts(
        self,
-        artifacts: list[TextArtifact] | dict[str, list[TextArtifact]],
+        artifacts: list[TextArtifact] | dict[str, list[TextArtifact] | list[ImageArtifact]],


why do we want to support ImageArtifact in a method called upsert_text_artifacts? if we do keep it, should the first type also be list[TextArtifact] | list[ImageArtifact]

emjay07 · 2025-03-01T00:15:19Z

griptape/drivers/vector/base_vector_store_driver.py

+    @overload
+    def upsert_collection(
+        self,
+        artifacts: list[TextArtifact],


should we support both text and image here?

emjay07 · 2025-03-01T00:15:43Z

griptape/drivers/vector/base_vector_store_driver.py

+
+    def upsert_collection(
+        self,
+        artifacts: list[TextArtifact] | dict[str, list[TextArtifact] | list[ImageArtifact]],


same commont on first type, or list[ImageArtifact]

emjay07 · 2025-03-01T00:20:43Z

griptape/drivers/vector/base_vector_store_driver.py

        self,
-        artifact: TextArtifact,
+        value: str | TextArtifact | ImageArtifact,


since we are allowing users to input str types, should we also allow bytes if a user has a loaded image already in memory they can upsert that may not be wrapped as an ImageArtifact?

emjay07 · 2025-03-01T00:21:36Z

griptape/drivers/vector/griptape_cloud_vector_store_driver.py

@@ -97,6 +98,8 @@ def query(

        Performs a query on the Knowledge Base and returns Artifacts with close vector proximity to the query, optionally filtering to only those that match the provided filter(s).
        """
+        if isinstance(query, BaseArtifact):


why allow the type, just to raise an error later? what if someone tries to pass a TextArtifact? should the type be ImageArtifact instead?

collindutter force-pushed the feature/image-embeddings-vector branch from 08ccde7 to 79e7dcf Compare February 20, 2025 00:28

collindutter force-pushed the feature/image-embeddings-vector branch 6 times, most recently from 4459f3d to 159a25c Compare February 24, 2025 23:02

collindutter self-assigned this Feb 25, 2025

collindutter force-pushed the feature/image-embeddings-vector branch 6 times, most recently from 7d925d3 to bc85088 Compare February 28, 2025 22:36

collindutter changed the title ~~Feature/image embeddings vector~~ feat: add support for image embeddings Feb 28, 2025

collindutter added 5 commits February 28, 2025 14:45

feat(drivers-embedding): add support for embedding ImageArtifacts

69963aa

feat(drivers-embedding-voyageai): add support for generating image em…

21ccfc3

…beddings

feat(drivers-embedding-titan): add support for generating image embed…

65c4e4e

…dings

feat(drivers-local-vector): support persisting multi-modal entries

39ceb3a

feat(drivers-vector): support querying with ImageArtifacts

bc5d697

collindutter force-pushed the feature/image-embeddings-vector branch from bc85088 to bc5d697 Compare February 28, 2025 22:46

collindutter requested review from emjay07 and a team February 28, 2025 22:51

collindutter marked this pull request as ready for review February 28, 2025 22:51

emjay07 reviewed Mar 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for image embeddings #1753

feat: add support for image embeddings #1753

collindutter commented Feb 20, 2025 •

edited

Loading

codecov bot commented Feb 20, 2025 •

edited

Loading

emjay07 Feb 28, 2025

emjay07 Mar 1, 2025

emjay07 Mar 1, 2025

emjay07 Mar 1, 2025

emjay07 Mar 1, 2025

emjay07 Mar 1, 2025

feat: add support for image embeddings #1753

Are you sure you want to change the base?

feat: add support for image embeddings #1753

Conversation

collindutter commented Feb 20, 2025 • edited Loading

Describe your changes

Issue ticket number and link

codecov bot commented Feb 20, 2025 • edited Loading

Codecov Report

emjay07 Feb 28, 2025

Choose a reason for hiding this comment

emjay07 Mar 1, 2025

Choose a reason for hiding this comment

emjay07 Mar 1, 2025

Choose a reason for hiding this comment

emjay07 Mar 1, 2025

Choose a reason for hiding this comment

emjay07 Mar 1, 2025

Choose a reason for hiding this comment

emjay07 Mar 1, 2025

Choose a reason for hiding this comment

collindutter commented Feb 20, 2025 •

edited

Loading

codecov bot commented Feb 20, 2025 •

edited

Loading