feat: exclude mamba blocks for jamba when load8bit #1578

NanoCode012 · 2024-04-30T14:18:59Z

Description

Closes #1498

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

NanoCode012 · 2024-04-30T14:19:38Z

@creatorrr I made the PR. Please let me know if you could test and confirm it works.

creatorrr · 2024-04-30T14:31:23Z

I'd have to do a training run with and without the patch to compare but the recommendation from the Jamba team is exactly this so 💯 from me. And thanks a bunch for getting around to this!

feat: exclude mamba blocks for jamba

e21ec47

NanoCode012 changed the title ~~feat: exclude mamba blocks for jamba~~ feat: exclude mamba blocks for jamba when load8bit Apr 30, 2024

winglian approved these changes May 3, 2024

View reviewed changes

winglian added the ready to merge label May 3, 2024

NanoCode012 merged commit 8b9c15b into axolotl-ai-cloud:main May 7, 2024
7 checks passed

NanoCode012 deleted the feat/no-quant-mamba branch May 7, 2024 13:52

djsaunde pushed a commit that referenced this pull request Dec 17, 2024

feat: exclude mamba blocks for jamba (#1578)

c273fa1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: exclude mamba blocks for jamba when load8bit #1578

feat: exclude mamba blocks for jamba when load8bit #1578

NanoCode012 commented Apr 30, 2024

NanoCode012 commented Apr 30, 2024

creatorrr commented Apr 30, 2024

feat: exclude mamba blocks for jamba when load8bit #1578

feat: exclude mamba blocks for jamba when load8bit #1578

Conversation

NanoCode012 commented Apr 30, 2024

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

NanoCode012 commented Apr 30, 2024

creatorrr commented Apr 30, 2024