[Feature]: Implement /api/generate for Continue.dev FIM / autocompletion with Ollama? #6900

deliciousbob · 2024-11-25T21:46:16Z

The Feature

I am using Ollama as a Backend for my models.
In Continue.dev I want to use Qwen2.5 1.5B to autocomplete my code.
This works perfectly if set up the config to directly talk to the Ollama API under http://ollamahostip:11434**/api/generate**.

I never got it to work with directly talking to the LiteLLM-API (using mistral api or openai api) so I tried the pass-through function and that finally worked. I have two PCs running the same model as redundancy, so if I set up a pass-through, only one server would be utilized.

I also use Langfuse for Monitoring the requests, and when using pass-through the API User is not visible.
My questions, are there any plans to implement /api/generate ?

Thank you very much!
Best regards, Robert

Motivation, pitch

I want to always use LiteLLM for all my AI-API-Requests, it would be great if the endpoint /api/generate can be implemented.

Twitter / LinkedIn details

No response

krrishdholakia · 2024-11-26T09:58:32Z

ollama /api/generate is already supported -

litellm/litellm/llms/ollama.py

Line 284 in 8673f25

if api_base.endswith("/api/generate"):

can you share a sample request to repro the issue? for FIM tasks we recommend using the /completions endpoint, not /chat/completions

deliciousbob · 2024-11-26T14:11:48Z

Hi Krish, thank you for the quick reply.
I was not able to find the /api/generate in the Swagger of Litellm (https://litellm-api.up.railway.app/)
Contiune.dev tries to directly contact url:port**/api/generate** when selecting ollama as provider. (i added the LiteLLM url:4000 as a baseurl to handle the requests)
They do not support OpenAI API as OpenAI does not support FIM they mentioned, so only Ollama API or Mistral is supported.
(see: https://docs.continue.dev/autocomplete/model-setup).

deliciousbob added the enhancement New feature or request label Nov 25, 2024

krrishdholakia added the awaiting: user response label Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Implement /api/generate for Continue.dev FIM / autocompletion with Ollama? #6900

[Feature]: Implement /api/generate for Continue.dev FIM / autocompletion with Ollama? #6900

deliciousbob commented Nov 25, 2024

krrishdholakia commented Nov 26, 2024

deliciousbob commented Nov 26, 2024

[Feature]: Implement /api/generate for Continue.dev FIM / autocompletion with Ollama? #6900

[Feature]: Implement /api/generate for Continue.dev FIM / autocompletion with Ollama? #6900

Comments

deliciousbob commented Nov 25, 2024

The Feature

Motivation, pitch

Twitter / LinkedIn details

krrishdholakia commented Nov 26, 2024

deliciousbob commented Nov 26, 2024