[Bug]: Different Behavior with Image Input on GROQ/Llama 3.2 Vision Model vs Qwen #6912

NEWbie0709 · 2024-11-26T08:26:45Z

What happened?

I've encountered an issue while using LiteLLM with the GROQ/Llama 3.2 Vision model and Qwen. The problem arises specifically when providing an image input.

--GROQ/Llama 3.2 Vision Model: Works as expected with image inputs.
--Qwen-vl-max-latest: Produces an error when processing the same image input.

code using

import requests
import json

url = 'http://localhost:4000/chat/completions'

headers = {
    'Content-Type': 'application/json',
    'Authorization': 'Bearer sk-1234'
}

data = {
    "model": "model",  # your model_name
    "messages": [
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "What's in this image?"
                },
                {
                    "type": "image_url",
                    "image_url": {
                        "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
                    }
                }
            ]
        }
    ]
}

print(json.dumps(data))
response = requests.post(url, headers=headers, data=json.dumps(data))

print(response.status_code)
print(response.json())

Output from groq

Output from qwen

Relevant log output

No response

Twitter / LinkedIn details

No response

The text was updated successfully, but these errors were encountered:

krrishdholakia · 2024-12-07T17:39:35Z

qwen isn't a supported provider. Are they openai compatible?

NEWbie0709 · 2024-12-09T01:29:12Z

Yes, Qwen is OpenAI-compatible. I’m using the OpenAI-compatible method to access it through LiteLLM
https://help.aliyun.com/zh/dashscope/developer-reference/compatibility-of-openai-with-dashscope?spm=a2c4g.11186623.help-menu-610100.d_3_6_0.7575528aKnGgE0

NEWbie0709 added the bug Something isn't working label Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Different Behavior with Image Input on GROQ/Llama 3.2 Vision Model vs Qwen #6912

[Bug]: Different Behavior with Image Input on GROQ/Llama 3.2 Vision Model vs Qwen #6912

NEWbie0709 commented Nov 26, 2024

krrishdholakia commented Dec 7, 2024

NEWbie0709 commented Dec 9, 2024

[Bug]: Different Behavior with Image Input on GROQ/Llama 3.2 Vision Model vs Qwen #6912

[Bug]: Different Behavior with Image Input on GROQ/Llama 3.2 Vision Model vs Qwen #6912

Comments

NEWbie0709 commented Nov 26, 2024

What happened?

Relevant log output

Twitter / LinkedIn details

krrishdholakia commented Dec 7, 2024

NEWbie0709 commented Dec 9, 2024