Fixes for Gemini inside Vertex #11511

tbeamish-benchsci · 2024-02-29T17:55:54Z

Description

Gemini requires an even message count and, in the Gemini LLM, we merge_neighboring_same_role_messages. This PR does the same thing in the Vertex LLM if we're using Gemini.
We also set the default system_role in the LLMMetadata to be System.USER only if we're using Gemini, since Gemini only supports "user" and "model" roles.
Finally we ensure the role of each message in Vertex is set to the appropriate MessageRole

NOTE: This change adds/updates NO UNIT TESTs to support it.

Fixes # (#11439)

Type of Change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas

llama-index-integrations/llms/llama-index-llms-vertex/llama_index/llms/vertex/base.py

llama-index-integrations/llms/llama-index-llms-gemini/llama_index/llms/gemini/utils.py

logan-markewich

this looks good to me, thanks @tbeamish-benchsci !

* main: (61 commits) bump anthropic versions (run-llama#11654) Fixes for Gemini inside Vertex (run-llama#11511) Update multimodal anthropic docs (run-llama#11643) fix anthropic tokenizer (run-llama#11636) extend claude 3 multimodal cookbook (run-llama#11639) fix pydantic (run-llama#11631) Added Nest Asyncio to Opensearch Client Init to Support ASGI applicat… (run-llama#11622) update readmes (run-llama#11627) typo in chat refine prompt (run-llama#11628) Add Anthropic Multi Modal (run-llama#11623) Add Vertex AI Text Embedding and Multimodal Embedding (run-llama#11561) Update anthropic models (run-llama#11612) Bumped Opensearch Integration Version (run-llama#11618) Solve issue of duplicate Settings classes (run-llama#11606) fix composable retrieval (run-llama#11617) Updated AzureCosmosDBMongoDBVectorSearch to Pydantic Vector Store base class (run-llama#11613) Updated model information for Perplexity.ai (run-llama#11603) Solve issue in custom_agent document , RetryAgentWorker._run_step() got an unexpected keyword argument 'input' (run-llama#11611) Fix empty metadata error for CSV Reader (run-llama#11563) Feature: Improve batch embedding generation throughput for Cohere in Bedrock (run-llama#11572) ...

* main: bump anthropic versions (run-llama#11654) Fixes for Gemini inside Vertex (run-llama#11511) Update multimodal anthropic docs (run-llama#11643) fix anthropic tokenizer (run-llama#11636) extend claude 3 multimodal cookbook (run-llama#11639) fix pydantic (run-llama#11631) Added Nest Asyncio to Opensearch Client Init to Support ASGI applicat… (run-llama#11622) update readmes (run-llama#11627) typo in chat refine prompt (run-llama#11628) Add Anthropic Multi Modal (run-llama#11623) Add Vertex AI Text Embedding and Multimodal Embedding (run-llama#11561) Update anthropic models (run-llama#11612) Bumped Opensearch Integration Version (run-llama#11618) Solve issue of duplicate Settings classes (run-llama#11606) fix composable retrieval (run-llama#11617) Updated AzureCosmosDBMongoDBVectorSearch to Pydantic Vector Store base class (run-llama#11613) Updated model information for Perplexity.ai (run-llama#11603) Solve issue in custom_agent document , RetryAgentWorker._run_step() got an unexpected keyword argument 'input' (run-llama#11611) Fix empty metadata error for CSV Reader (run-llama#11563)

tbeamish-benchsci force-pushed the fixes-for-gemini-in-vertex branch from a6f9b27 to 563e4a6 Compare February 29, 2024 17:57

tbeamish-benchsci commented Feb 29, 2024

View reviewed changes

llama-index-integrations/llms/llama-index-llms-vertex/llama_index/llms/vertex/base.py Outdated Show resolved Hide resolved

tbeamish-benchsci force-pushed the fixes-for-gemini-in-vertex branch from e425f90 to 3d2bc58 Compare February 29, 2024 23:14

tbeamish-benchsci commented Feb 29, 2024

View reviewed changes

llama-index-integrations/llms/llama-index-llms-gemini/llama_index/llms/gemini/utils.py Outdated Show resolved Hide resolved

tbeamish-benchsci force-pushed the fixes-for-gemini-in-vertex branch 2 times, most recently from 295872c to 2fd9b0e Compare March 4, 2024 22:17

Set MessageRole appropriately for Gemini via Vertex

7ee8e4d

tbeamish-benchsci force-pushed the fixes-for-gemini-in-vertex branch from 2fd9b0e to 7ee8e4d Compare March 4, 2024 22:24

tbeamish-benchsci marked this pull request as ready for review March 4, 2024 22:42

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Mar 4, 2024

logan-markewich approved these changes Mar 4, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Mar 4, 2024

bump

49e8593

logan-markewich merged commit c5bed00 into run-llama:main Mar 5, 2024
8 checks passed

tbeamish-benchsci deleted the fixes-for-gemini-in-vertex branch March 5, 2024 17:01

Izukimat pushed a commit to Izukimat/llama_index that referenced this pull request Mar 29, 2024

Fixes for Gemini inside Vertex (run-llama#11511)

afc6f76

This was referenced Apr 5, 2024

[Bug]: Not able to use Gemini. #12100

Closed

Revert "bump google-generativeai to 0.4 (#12085)" #12126

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for Gemini inside Vertex #11511

Fixes for Gemini inside Vertex #11511

tbeamish-benchsci commented Feb 29, 2024 •

edited

Loading

logan-markewich left a comment

Fixes for Gemini inside Vertex #11511

Fixes for Gemini inside Vertex #11511

Conversation

tbeamish-benchsci commented Feb 29, 2024 • edited Loading

Description

Type of Change

How Has This Been Tested?

Suggested Checklist:

logan-markewich left a comment

Choose a reason for hiding this comment

tbeamish-benchsci commented Feb 29, 2024 •

edited

Loading