modellist: automatically replace known chat templates with our versions #3327

cebtenzzre · 2024-12-19T19:26:06Z

This is a temporary fix for issues such as #3309, #3282, and #3263 until we can improve Jinja2Cpp itself.

The workaround for users loading a GGUF they downloaded from HuggingFace and then getting an ugly chat template that is in many cases incompatible with Jinja2Cpp, is to make a list of known templates and automatically replace them when found.

This is currently done silently and transparently, aside from a log message that looks like this:

[Warning] (Thu Dec 19 14:25:09 2024): automatically substituting chat template for "SummLlama3.2-3B-Q4_0.gguf"

We can make this list less necessary by improving Jinja2Cpp (see the comments in jinja_replacements.cpp for next steps), but for now this is better than what the current release does with these models.

This allows us to be compatible with e.g. any finetune of Llama 3.2 3B Instruct that does not alter the chat template, without having to touch models3.json.

models3.json has been touched in this PR for consistency with the list of substitutions, which now covers all official GPT4All models, even the ones that were working before. This makes them prettier and easier to edit.

Signed-off-by: Jared Van Bortel <[email protected]>

Signed-off-by: AT <[email protected]>

cebtenzzre added 2 commits December 19, 2024 14:25

modellist: automatically replace known chat templates with our versions

2fa23db

Signed-off-by: Jared Van Bortel <[email protected]>

changelog: add this PR

804541a

Signed-off-by: Jared Van Bortel <[email protected]>

cebtenzzre marked this pull request as ready for review December 19, 2024 19:33

cebtenzzre requested a review from manyoso December 19, 2024 19:33

add Llama-3.3-70B-Instruct-Q4_0.gguf

71be437

Signed-off-by: Jared Van Bortel <[email protected]>

cebtenzzre added 2 commits December 19, 2024 14:53

add calme-2.1-phi3.5-4b.Q6_K.gguf

3227ebe

Signed-off-by: Jared Van Bortel <[email protected]>

mention more equivalent models

c1e6d03

Signed-off-by: Jared Van Bortel <[email protected]>

manyoso approved these changes Dec 19, 2024

View reviewed changes

Merge branch 'main' into jinja-substitutions

c878a94

manyoso mentioned this pull request Dec 19, 2024

Start PR for known working chat templates. #3290

Closed

cebtenzzre and others added 2 commits December 19, 2024 16:29

Merge branch 'main' into jinja-substitutions

02972db

Merge branch 'main' into jinja-substitutions

1f203ed

Signed-off-by: AT <[email protected]>

manyoso merged commit 6bbeac2 into main Dec 19, 2024
3 of 4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modellist: automatically replace known chat templates with our versions #3327

modellist: automatically replace known chat templates with our versions #3327

cebtenzzre commented Dec 19, 2024 •

edited

Loading

modellist: automatically replace known chat templates with our versions #3327

modellist: automatically replace known chat templates with our versions #3327

Conversation

cebtenzzre commented Dec 19, 2024 • edited Loading

cebtenzzre commented Dec 19, 2024 •

edited

Loading