[Usage]: how do I pass in the JSON content-type for Mistral 7B using offline inference? #7030

RoopeHakulinen · 2024-08-01T10:56:45Z

Your current environment

The output of `python collect_env.py`

How would you like to use vllm

I would like to use the JSON mode for Mistral 7B while doing offline inference using the generate method as below. Is that possible somehow? Just using the prompt doesn't seem to produce JSON output as requested. If this is not possible, is the only solution to use something like Outlines? Would love some details on that, if so.

llm = LLM(model="mistralai/Mistral-7B-v0.3")
prompt = "Please name the biggest and smallest continent in JSON using the following schema: {biggest: <the biggest continent's name>, smallest: <the smallest continent>}"
sampling_params = SamplingParams(temperature=temperature, top_p=1.0)
response = self.llm.generate(prompt, sampling_params)

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2024-08-01T11:41:26Z

This will become available to offline LLM by #6878.

RoopeHakulinen added the usage How to use vllm label Aug 1, 2024

DarkLight1337 mentioned this issue Aug 1, 2024

Support for guided decoding for offline LLM #6878

Merged

DarkLight1337 closed this as completed in #6878 Aug 4, 2024

fatihyildiz-cs mentioned this issue Aug 27, 2024

[Usage]: how do I pass in the JSON content-type for ASYNC Mistral 7B offline inference #7908

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Usage]: how do I pass in the JSON content-type for Mistral 7B using offline inference? #7030

[Usage]: how do I pass in the JSON content-type for Mistral 7B using offline inference? #7030

RoopeHakulinen commented Aug 1, 2024

DarkLight1337 commented Aug 1, 2024 •

edited

Loading

[Usage]: how do I pass in the JSON content-type for Mistral 7B using offline inference? #7030

[Usage]: how do I pass in the JSON content-type for Mistral 7B using offline inference? #7030

Comments

RoopeHakulinen commented Aug 1, 2024

Your current environment

How would you like to use vllm

DarkLight1337 commented Aug 1, 2024 • edited Loading

DarkLight1337 commented Aug 1, 2024 •

edited

Loading