Structured output with vLLM? #8300

AkshataDM · 2024-09-09T15:13:58Z

AkshataDM
Sep 9, 2024

Has anyone looked into getting structured output from a model hosted using vllm? Would like some pointers on implementing this if anyone has any suggestions.

jeejeelee · 2024-09-10T08:17:52Z

jeejeelee
Sep 10, 2024
Collaborator

See: https://github.com/vllm-project/vllm/blob/v0.6.0/vllm/engine/arg_utils.py#L276

0 replies

NicolasDrapier · 2024-09-12T07:36:00Z

NicolasDrapier
Sep 12, 2024

Hi @AkshataDM
Yes, I enforce the model to respond with a structured output via lm-format-enforcer. In my Docker setup, I add the parameter --guided-decoding-backend lm-format-enforcer, and then I call it with the following code:

from pydantic import BaseModel
import json
from openai import OpenAI

class AnswerFormat(BaseModel):
    name: str
    lastname: str
    age: int

client = OpenAI(
    base_url="http://...../v1",
    api_key="token",
)

user_prompt = "......"
completion = client.chat.completions.create(
    model="yourmodelhere",
    messages=[
        {"role": "user", "content": user_prompt}
    ],
    extra_body={
        "guided_json": AnswerFormat.model_json_schema()
    }
)

result = json.loads(completion.choices[0].message.content)

1 reply

MohammedAlsayed Oct 22, 2024

When I enable --guided-decoding-backend, the function calling options (--enable-auto-tool-choice --tool-call-parser llama3_json --chat-template tool_chat_template_llama3.2_json.jinja) doesn't execute ... any ideas?

datalee · 2024-11-13T02:43:24Z

datalee
Nov 13, 2024

must 0.6+ support？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Structured output with vLLM? #8300

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Structured output with vLLM? #8300

AkshataDM Sep 9, 2024

Replies: 3 comments · 1 reply

jeejeelee Sep 10, 2024 Collaborator

NicolasDrapier Sep 12, 2024

MohammedAlsayed Oct 22, 2024

datalee Nov 13, 2024

AkshataDM
Sep 9, 2024

Replies: 3 comments 1 reply

jeejeelee
Sep 10, 2024
Collaborator

NicolasDrapier
Sep 12, 2024

datalee
Nov 13, 2024