Add support for Gemini Structured Outputs #472

willbakst · 2024-09-05T19:03:42Z

Description

Recently Google released Structured Outputs for Gemini (much like OpenAI).

Should be fairly simple to enable this by updating setup_call for gemini and updating the warning to include gemini as a module that supports strict in the response model schema (although it seems like structured outputs for tools is not quite supported).

See: https://ai.google.dev/gemini-api/docs/json-mode?lang=python#supply-schema-in-config

Only difficulty I can see right now is that the only two supported inputs are either raw type annotations (so no BaseModel) or their genai.protos.Schema type (which will require conversion from BaseModel). Should have most of this implemented already I believe, so still shouldn't be terribly difficult.

The text was updated successfully, but these errors were encountered:

AbhishekRP2002 · 2024-11-04T03:49:57Z

hi @willbakst if this issue's still open, I would love to take this up!

willbakst · 2024-11-04T03:52:37Z

@AbhishekRP2002 it's open! would love for you to take this

if you have any questions let me know! our implementation of OpenAI structured outputs should act as a good reference

AbhishekRP2002 · 2024-11-04T16:33:43Z

sure, will let you know if there's any blocker in any way.

AbhishekRP2002 · 2024-12-07T21:41:34Z

hi @willbakst apologies for no updates since past 1 month, been piled with lots of eng work, so couldn't give this much time.
i was going through the codebase last night and noticed some form of implementation for structured_output in gemini, however i don't see the response_schema param in the config:

i also took a look at the openai implementation,
here is what i am thinking:
adding something like

if response_schema:
        # Convert response_schema to compatible format
        converted_schema = _convert_schema(response_schema)
        
        call_kwargs['generation_config'] = {
            'response_mime_type': 'application/json',
            'response_schema': converted_schema

i also noticed we can extract structured outputs via function calling in gemini which seemed more close to the openai implementation under _setup_call.py

found this example also: https://github.com/Mirascope/mirascope/blob/f4474e505a7bd67741faa44065fd7ba772681c50/examples/learn/response_models/json_mode/gemini/messages.py

seems like already implemented ?

would be really grateful if u can share your thoughts or help me out here in any way

willbakst · 2024-12-07T22:51:04Z

Ah I can see the confusion here. Hopefully this explanation will clear things up.

There are three types of structured outputs:

Tools (which we attempt to force to be called and then extract / convert into the originally provided model)
JSON mode (same thing as tools except we use JSON to power the structure)
Provider-specific structured outputs (i.e. OpenAI).

We've implemented (1) and (2) across all currently supported providers, but we've only implemented (3) for OpenAI (see here), but Gemini now also supports strict structured outputs.

The idea with this issue would be to add the same support for Gemini that we have for OpenAI (by setting model_config = ResponseModelConfigDict(strict=True)).

willbakst added the enhancement New feature or request label Sep 5, 2024

willbakst added the good first issue Good for newcomers label Sep 20, 2024

willbakst added the hacktoberfest Hacktoberfest! label Oct 16, 2024

willbakst removed the hacktoberfest Hacktoberfest! label Nov 4, 2024

willbakst assigned AbhishekRP2002 Nov 5, 2024

willbakst added the Feature Request label Dec 19, 2024 — with Linear

willbakst removed the Feature Request label Dec 19, 2024

willbakst added the mirascope label Dec 19, 2024 — with Linear

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Gemini Structured Outputs #472

Add support for Gemini Structured Outputs #472

willbakst commented Sep 5, 2024

AbhishekRP2002 commented Nov 4, 2024

willbakst commented Nov 4, 2024

AbhishekRP2002 commented Nov 4, 2024

AbhishekRP2002 commented Dec 7, 2024 •

edited

Loading

willbakst commented Dec 7, 2024

Add support for Gemini Structured Outputs #472

Add support for Gemini Structured Outputs #472

Comments

willbakst commented Sep 5, 2024

Description

AbhishekRP2002 commented Nov 4, 2024

willbakst commented Nov 4, 2024

AbhishekRP2002 commented Nov 4, 2024

AbhishekRP2002 commented Dec 7, 2024 • edited Loading

willbakst commented Dec 7, 2024

AbhishekRP2002 commented Dec 7, 2024 •

edited

Loading