[REQUEST] Support for the new Command-r7b #703

ciprianveg · 2024-12-22T13:00:52Z

Problem

Hello, can we, please, get support for the 128k context Command-r7b, I would like to use it both as fast tool selection based on user prompt, before calling the big Command-r brother, and also as a draft model accelerator for Command-r or Command-r plus.

Solution

An exl2 supporting the full 128k that can be also used as draft model.

Alternatives

No response

Explanation

Faster Command-r generation and almost instant tool selection.

Examples

No response

Additional context

No response

Acknowledgements

I have looked for similar requests before submitting this one.
I understand that the developers have lives and my issue will be answered when possible.
I understand the developers of this program are human, and I will make my requests politely.

turboderp · 2024-12-25T19:23:47Z

I've added support in the dev branch, or at least an attempt at it. It appears the architecture is identical to Cohere with the exception of SWA on all but every 4th layer.

It seems to be coherent at least up to the model's native 8k context limit, but although the readme mentions a 128k context length, I don't see any hint of that in the model's config, or in the HF implementation. Is it supposed to use YaRN?

ciprianveg · 2024-12-26T15:01:38Z

Hi, thank you for your work on this. Can it be only a config default value issue as it was for previousCommand-r? https://huggingface.co/CohereForAI/c4ai-command-r-v01/discussions/12

turboderp · 2024-12-26T16:52:39Z

It might be. You can always try just giving it a longer max_seq_len to override the default. I tested it a bit as just an instruct model, using the old Cohere template, and it does eventually turn incoherent, but I'm not sure it's supposed to be used this way. I.e. it might be fine producing tool calls from a 100k prompt.

ciprianveg · 2024-12-27T07:34:14Z

I don't know how to build the windows exllama from the dev branch to be able to test.. I am using it via tabbyapi..

turboderp · 2024-12-27T15:37:12Z

There should be a new release shortly.

turboderp · 2024-12-30T21:53:31Z

0.2.7 is released now if you missed it. Feel free to open another issue if something is still broken.

ciprianveg · 2024-12-30T22:00:21Z

Thank you! I will test it and let you know if any issues!

turboderp closed this as completed Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Support for the new Command-r7b #703

[REQUEST] Support for the new Command-r7b #703

ciprianveg commented Dec 22, 2024

turboderp commented Dec 25, 2024

ciprianveg commented Dec 26, 2024

turboderp commented Dec 26, 2024

ciprianveg commented Dec 27, 2024

turboderp commented Dec 27, 2024

turboderp commented Dec 30, 2024

ciprianveg commented Dec 30, 2024

[REQUEST] Support for the new Command-r7b #703

[REQUEST] Support for the new Command-r7b #703

Comments

ciprianveg commented Dec 22, 2024

Problem

Solution

Alternatives

Explanation

Examples

Additional context

Acknowledgements

turboderp commented Dec 25, 2024

ciprianveg commented Dec 26, 2024

turboderp commented Dec 26, 2024

ciprianveg commented Dec 27, 2024

turboderp commented Dec 27, 2024

turboderp commented Dec 30, 2024

ciprianveg commented Dec 30, 2024