Feature Request: Add support for Pixtral and other Vision models (llama 3.2 11b/90b etc) #5

YorkieDev · 2024-10-08T19:48:08Z

Pixtral works great in mlx-vlm (Blaizzy/mlx-vlm#67) would be great to see support land in LM Studio.

youcefs21 · 2024-10-11T15:35:45Z

mlx-vlm version in lm studio is 0.0.13, pixtral is supported in 0.0.15, don't we just need to upgrade the mlx-vlm version?

YorkieDev · 2024-10-11T15:39:22Z

That would probably work. Seems fairly straightforward to do @yagil @neilmehta24

Blaizzy · 2024-10-11T16:58:23Z

Wait for v0.1.0 (Blaizzy/mlx-vlm#41)

It will be released later today or tomorrow.

Blaizzy · 2024-10-11T16:58:49Z

It has some fixes and many new features.

julien-blanchon · 2024-10-11T23:28:15Z

Awesome v0.1.0 is now merged

mattjcly · 2024-10-17T16:03:25Z

Pixtral is now supported thanks to @Blaizzy !

julien-blanchon · 2024-10-17T16:43:58Z

I hope this will get bundled to LM Studio soon

yagil · 2024-10-17T18:39:23Z

I hope this will get bundled to LM Studio soon

Not there yet, but keep an eye on https://lmstudio.ai/beta-releases

neilmehta24 · 2024-10-30T18:53:23Z

As of #22, mlx-engine has Pixtral and Llama 3.2 vision support. We expect to roll this out to LM Studio soon.

orcinus · 2024-12-23T05:53:51Z

Any updates? Vision support in general in LM Studio is abysmal (UI and UX seem more like an MVP than actual usable thing, and recent vision enabled models like Qwen VL are extremely buggy / bordering on unusable). Combined with being months late with support for major vision-enabled LLMs makes using LM Studio a tough sell (and yes, i know it's free - i'd gladly pay for it if it meant faster rollout of features and architecture support).

YorkieDev · 2024-12-23T16:01:23Z

Marking this issue as solved as 0.3.5 Build 9 has support for Pixtral and Llama 3.2 Vision in the MLX Engine.

https://lmstudio.ai/beta-releases

orcinus · 2024-12-23T17:05:52Z

Still doesn't work.
Getting Unknown ArrayValue filter: trim when trying to load MLX Llama 3.2 Vision Instruct.

orcinus · 2024-12-23T17:09:47Z

Nevermind, i'm dumb. Apparently, vision llama 3.2 does not have a System role (but regular 3.2 does). Weird.

orcinus · 2024-12-23T17:30:48Z

Unfortunately, it still doesn't work. It leaks RAM badly.
Deleting messages from context doesn't reduce RAM usage either, it just keeps ballooning the moment you start first inference.

yagil · 2024-12-23T18:14:31Z

@orcinus worth it to open an issue for performance. We'd track it separately. Also cc @Blaizzy

orcinus · 2024-12-23T21:40:14Z

I'm too slow - someone already did in here: #63
I've added my own case in that one.

Blaizzy · 2024-12-24T00:14:52Z

Thanks!

I'm working on it from tomorrow.

neilmehta24 added the fixed-in-next-release The next release of LM Studio fixes this issue label Nov 6, 2024

YorkieDev mentioned this issue Nov 21, 2024

Failed to load model: Model type mllama not supported lmstudio-ai/lmstudio-bug-tracker#208

Open

YorkieDev closed this as completed Dec 23, 2024

yagil mentioned this issue Dec 24, 2024

0.3.5b9 Memory leak with MLX models #63

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add support for Pixtral and other Vision models (llama 3.2 11b/90b etc) #5

Feature Request: Add support for Pixtral and other Vision models (llama 3.2 11b/90b etc) #5

YorkieDev commented Oct 8, 2024

youcefs21 commented Oct 11, 2024 •

edited

Loading

YorkieDev commented Oct 11, 2024

Blaizzy commented Oct 11, 2024

Blaizzy commented Oct 11, 2024

julien-blanchon commented Oct 11, 2024

mattjcly commented Oct 17, 2024

julien-blanchon commented Oct 17, 2024

yagil commented Oct 17, 2024

neilmehta24 commented Oct 30, 2024

orcinus commented Dec 23, 2024 •

edited

Loading

YorkieDev commented Dec 23, 2024

orcinus commented Dec 23, 2024

orcinus commented Dec 23, 2024

orcinus commented Dec 23, 2024

yagil commented Dec 23, 2024

orcinus commented Dec 23, 2024

Blaizzy commented Dec 24, 2024

Feature Request: Add support for Pixtral and other Vision models (llama 3.2 11b/90b etc) #5

Feature Request: Add support for Pixtral and other Vision models (llama 3.2 11b/90b etc) #5

Comments

YorkieDev commented Oct 8, 2024

youcefs21 commented Oct 11, 2024 • edited Loading

YorkieDev commented Oct 11, 2024

Blaizzy commented Oct 11, 2024

Blaizzy commented Oct 11, 2024

julien-blanchon commented Oct 11, 2024

mattjcly commented Oct 17, 2024

julien-blanchon commented Oct 17, 2024

yagil commented Oct 17, 2024

neilmehta24 commented Oct 30, 2024

orcinus commented Dec 23, 2024 • edited Loading

YorkieDev commented Dec 23, 2024

orcinus commented Dec 23, 2024

orcinus commented Dec 23, 2024

orcinus commented Dec 23, 2024

yagil commented Dec 23, 2024

orcinus commented Dec 23, 2024

Blaizzy commented Dec 24, 2024

youcefs21 commented Oct 11, 2024 •

edited

Loading

orcinus commented Dec 23, 2024 •

edited

Loading