Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No Ollama provider #40

Open
awaescher opened this issue Jan 24, 2025 · 7 comments
Open

No Ollama provider #40

awaescher opened this issue Jan 24, 2025 · 7 comments
Labels
enhancement New feature or request model

Comments

@awaescher
Copy link

I just downloaded the installer for 0.0.2 on Windows.
In issue #11 I saw that there should be a provider for Ollama but it's missing for me.
Any idea how to overcome this?

Image

@6xdd
Copy link

6xdd commented Jan 24, 2025

Version 0.0.1 supports Olllam:

Ollam was canceled in version 0.0.2: Why refer to https://github.com/bytedance/UI-TARS-desktop#%EF%B8%8F-important-announcement-gguf-model-performance

@awaescher
Copy link
Author

This would be really sad! Thanks for the feedback.
But I doubt that this can be the reason as with 0.0.1 the Ollama provider is missing, too.

Image

@RoversX
Copy link

RoversX commented Jan 24, 2025

At least let us give it a try 🤣

@DefamationStation
Copy link

Version 0.0.1 supports Olllam:

Ollam was canceled in version 0.0.2: Why refer to https://github.com/bytedance/UI-TARS-desktop#%EF%B8%8F-important-announcement-gguf-model-performance

This is not true. 0.0.1 doesn't have Ollama either.

@timothycarambat
Copy link

Just use the baseURL (http://localhost:11434/v1) for the ollama instance with the model name in ollama as the model name. For key just type random stuff. Its literally just an API endpoint - it uses the openai package in the backend so any OpenAI compatible service will work (Ollama, lmstudio, etc etc)

@ycjcl868 ycjcl868 added enhancement New feature or request model labels Feb 9, 2025
@JjjFangg
Copy link

Hello, we have observed that the performance of the Ollama + GGUF approach is currently inferior to cloud deployment. At this stage, we recommend using Hugging Face inference endpoints for optimal results. We will consider uploading to Ollama once local deployment achieves performance parity with online inference.

@awaescher
Copy link
Author

I understand. But why actively disabling a feature because of performance concerns? People might have an killer AI machine with actually good performance.

Even more important: For prototyping, it would be amazingly helpful to use Ollama to run 100% locally. Think about building an early prototype in an enterprise scenario where going the full way to use external services and share data with them might need weeks of discussions before even being able to try it out.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request model
Projects
None yet
Development

No branches or pull requests

7 participants