Performance #4

agileandy · 2024-11-21T07:25:35Z

agileandy
Nov 21, 2024

Hi, I'm a bit of a LLM noob, just starting to explore what is possible. At the moment I'm running a number of models through Ollama, but was keen to explore Apple native models and came across your repo.

Would it be reasonable of me to assume that an MLX model would run faster/more efficiently on Apple Silicon, than the 'equivalent' Ollama model running on Apple Silicon?

I'm using Msty as a desktop app with models served through Ollama, as it has some cools features, especially for RAG. Do you know if it possible to serve MLX model through Ollama so they can be used in Msty?

kspviswa · 2024-11-21T17:09:12Z

kspviswa
Nov 21, 2024
Maintainer

Ollama runs models based on llamacpp project which is based on PyTorch . Mlx models follows a different architecture. If you wanna use PyOllaMx with Mlx models try using another sister project called PyOMlx.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance #4

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Performance #4

agileandy Nov 21, 2024

Replies: 1 comment

kspviswa Nov 21, 2024 Maintainer

agileandy
Nov 21, 2024

kspviswa
Nov 21, 2024
Maintainer