Sampler variants #64

AsbjornOlling · 2024-12-16T14:12:50Z

This MR lets us implement different sampling methods with mututally-exclusive configuration options, without showing irrelevant options in the UI.

It's currently very boilerplate-y. I'm tempted to try and write a proc macro to generate the repetitive code.

This will be a breaking change, since it overhauls the sampling API in a backwards-incompatible manner.

AsbjornOlling · 2024-12-16T17:42:47Z

I wrote some low-effort macros. I think it's not too bad.

I also implemented all of the llama.cpp samplers available.

The only one I didn't add was "Infill". I don't really understand it, and it seems like there aren't rust bindings for it.

AsbjornOlling · 2024-12-16T17:59:06Z

It would be nice to group the "penalty" parameters together in some way.

emilnorsker · 2024-12-17T12:52:17Z

 ERROR: Model worker crashed: Lama.cpp failed fetching chat template: the model has no meta val - returned code -1
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:117)
ERROR: Model output channel died. Did the LLM worker crash?
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:123)
ERROR: Model worker crashed: Lama.cpp failed fetching chat template: the model has no meta val - returned code -1
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:117)
ERROR: Model output channel died. Did the LLM worker crash?
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:123)

I am getting this error after changing to this branch. The only model that i have found that it works with is Gemma 2 2B.

i have tried the following models:

llama-2 7b Q4
llama 3.2 3b Q2

Note: this is after upgrading the plugin in an existing repository. I will try to replicate in a new Godot project as well.
also downloading a new non llama model to check if it is just llama models.

emilnorsker · 2024-12-17T13:27:31Z

Alright i tested two more models which works, so it looks like its only llama models that doesn't work... The error persist on a clean project.

emilnorsker · 2024-12-17T13:54:39Z

I also tested the different parameters on the greedy sampler:

penalty last n = 10 => gave the same token everytime. which means it works.
penalty repeat = -100, 100, 10 => gave the token the several times and kept the exact same output. This looks like something here is broken?
penealty freq = 1000 no changes, same goes for -1000.
same results with the rest of the parameters

AsbjornOlling · 2025-01-06T15:43:38Z

 ERROR: Model worker crashed: Lama.cpp failed fetching chat template: the model has no meta val - returned code -1
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:117)
ERROR: Model output channel died. Did the LLM worker crash?
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:123)
ERROR: Model worker crashed: Lama.cpp failed fetching chat template: the model has no meta val - returned code -1
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:117)
ERROR: Model output channel died. Did the LLM worker crash?
   at: <nobodywho::NobodyWhoChat as godot_core::gen::classes::node::re_export::INode>::physics_process (src\lib.rs:123)
I am getting this error after changing to this branch. The only model that i have found that it works with is Gemma 2 2B.

i have tried the following models:

llama-2 7b Q4

llama 3.2 3b Q2

Note: this is after upgrading the plugin in an existing repository. I will try to replicate in a new Godot project as well. also downloading a new non llama model to check if it is just llama models.

this error appears on the main branch as well as this branch, for the llama2 model mentioned. I think the issue is that old gguf files don't have all the same metadata fields that newer llama.cpp expects. either way, this has nothing to do with the current MR.

AsbjornOlling · 2025-01-06T15:56:35Z

I also tested the different parameters on the greedy sampler:

penalty last n = 10 => gave the same token everytime. which means it works. penalty repeat = -100, 100, 10 => gave the token the several times and kept the exact same output. This looks like something here is broken? penealty freq = 1000 no changes, same goes for -1000. same results with the rest of the parameters

My best guess is that you get weird results because you used weird values.

Have a look at the llama.cpp docs for a bit more explanation: https://github.com/ggerganov/llama.cpp/blob/master/examples/main/README.md

It might be worthwhile to set explicit ranges for the sampler config values, to give people an idea of what values are sane

AsbjornOlling added 10 commits December 12, 2024 17:42

start work on doing sampler variants

51b47a6

a lot of boilerplate

80dcf76

it works!

fc0ceca

more boilerplate- add topk sampler

411a3b8

fix some compilation errors

30f8c62

put sampler boilerplate in separate files

a8f33da

fix a test

0865069

re-do it all with macros

1947c76

implement a bunch of samplers

af99efe

add DRY sampler

2d70abd

don't crash on unexpected get_property

b0e63f9

AsbjornOlling changed the title ~~Draft: Sampler variants~~ Sampler variants Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampler variants #64

Sampler variants #64

AsbjornOlling commented Dec 16, 2024 •

edited

Loading

AsbjornOlling commented Dec 16, 2024

AsbjornOlling commented Dec 16, 2024

emilnorsker commented Dec 17, 2024

emilnorsker commented Dec 17, 2024

emilnorsker commented Dec 17, 2024

AsbjornOlling commented Jan 6, 2025

AsbjornOlling commented Jan 6, 2025

Sampler variants #64

Are you sure you want to change the base?

Sampler variants #64

Conversation

AsbjornOlling commented Dec 16, 2024 • edited Loading

AsbjornOlling commented Dec 16, 2024

AsbjornOlling commented Dec 16, 2024

emilnorsker commented Dec 17, 2024

emilnorsker commented Dec 17, 2024

emilnorsker commented Dec 17, 2024

AsbjornOlling commented Jan 6, 2025

AsbjornOlling commented Jan 6, 2025

AsbjornOlling commented Dec 16, 2024 •

edited

Loading