Gemini 1.5 support #349

naman1608 · 2024-05-30T06:47:51Z

No description provided.

…eenshot-to-code into gemini-1.5-support

naman1608 · 2024-05-30T06:49:03Z

@abi even though it is working, the code being generated by the model is not for the image being uploaded, not sure if the prompt being sent is wrong or it is a model issue, how to check that?

abi · 2024-05-30T11:37:00Z

I have a helper function pprompt that lets you print the prompt nicely. My guess is that the image isn’t being sent correctly and that’s why it’s producing a random web page. I’ll take a look at your code to debug.

naman1608 · 2024-05-30T13:58:55Z

I used that, but only able to check the prompt in the OpenAI format, as the pprint function takes input in that format, after converting it to the format taken by Gemini is the issue I guess

msamylea · 2024-07-01T15:02:38Z

I rewrote the Gemini portion of this and it's working now for me. The issue was, Gemini expects the image file to be uploaded then referenced.

This is working for me:

sync def stream_gemini_response(
messages: List[ChatCompletionMessageParam],
api_key: str,
callback: Callable[[str], Awaitable[None]],
) -> str:

model = genai.GenerativeModel("gemini-1.5-flash-latest")
genai.configure(api_key=api_key)

gemini_messages = []

for message in messages:
    if isinstance(message["content"], str):
        gemini_messages.append(message["content"])
    elif isinstance(message["content"], list):
        for content in message["content"]:
            if content["type"] == "text":
                gemini_messages.append(content["text"])
            elif content["type"] == "image_url":
                image_url = content["image_url"]["url"]
                image_data = base64.b64decode(image_url.split(",")[1])
                image = Image.open(io.BytesIO(image_data))
                gemini_messages.append(image)

try:
    response = model.generate_content(gemini_messages, stream=True)
    for chunk in response:
        if chunk.text:
            await callback(chunk.text)

    full_response = "".join(chunk.text for chunk in response)
    return full_response
except Exception as e:
    print(f"An error occurred: {str(e)}")
    return ""

naman1608 and others added 5 commits May 28, 2024 23:56

add gemini 1.5 pro to lib models

08fb318

gemini model support added

f751fff

Merge branch 'abi:main' into gemini-1.5-support

4c17023

print statements removed

22f0b13

Merge branch 'gemini-1.5-support' of https://github.com/naman1608/scr…

2b86b19

…eenshot-to-code into gemini-1.5-support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemini 1.5 support #349

Gemini 1.5 support #349

naman1608 commented May 30, 2024

naman1608 commented May 30, 2024

abi commented May 30, 2024

naman1608 commented May 30, 2024

msamylea commented Jul 1, 2024

Gemini 1.5 support #349

Are you sure you want to change the base?

Gemini 1.5 support #349

Conversation

naman1608 commented May 30, 2024

naman1608 commented May 30, 2024

abi commented May 30, 2024

naman1608 commented May 30, 2024

msamylea commented Jul 1, 2024