[WIP] porting Medusa #1213

zhyncs · 2024-02-28T16:12:15Z

Motivation

As titled, porting FasterDecoding/Medusa

Modification

finished

1、Medusa weights conversion
2、Medusa weights loading
3、Porting FasterDecoding/Medusa Heads code with LMDeploy components and utilities
4、TP support: Distribute the weights equally based on hidden_size

We've used https://github.com/zhyncs/medusa-whl-centos7/releases/tag/2024.02.27, https://huggingface.co/FasterDecoding/medusa-vicuna-13b-v1.3, https://huggingface.co/lmsys/vicuna-13b-v1.3 to verify the correctness of porting code (fp16 and bf16).

under debugging

1、Porting generate_candidates and evaluate_posterior
2、Integrating with LlamaBatch

todo

1、add docs
2、add tests
3、benchmark

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

zhyncs · 2024-03-01T15:14:06Z

After creating a new commit, the updates I made are not showing up in this PR. I believe the changes were submitted correctly. It appears to be a bug on GitHub, so I will close this PR and open a new one.

zhyncs · 2024-03-01T15:19:05Z

refer to #1231 just close this

zhyncs · 2024-03-01T15:29:49Z

https://www.githubstatus.com/

It is indeed a problem with GitHub.

zhyncs added 2 commits February 28, 2024 23:53

feat: support fused_bias_residual_activation for medusa

377b564

feat: porting medusa head and resblock

66460a1

zhyncs mentioned this pull request Feb 28, 2024

feat: support fused_bias_residual_activation for medusa #1199

Closed

fix lint

5bcbf18

zhyncs mentioned this pull request Feb 29, 2024

[Feature] Medusa weights conversion #1180

Closed

5 tasks

zhyncs added 4 commits March 1, 2024 11:22

feat: update fused_bias_residual_activation for tp

e2cd9be

feat: update medusa weight for tp

75f9f1c

feat: tp support

3fcb18c

fix lint

106b734

zhyncs closed this Mar 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] porting Medusa #1213

[WIP] porting Medusa #1213

zhyncs commented Feb 28, 2024 •

edited

Loading

zhyncs commented Mar 1, 2024

zhyncs commented Mar 1, 2024 •

edited

Loading

zhyncs commented Mar 1, 2024

[WIP] porting Medusa #1213

[WIP] porting Medusa #1213

Conversation

zhyncs commented Feb 28, 2024 • edited Loading

Motivation

Modification

finished

under debugging

todo

Checklist

zhyncs commented Mar 1, 2024

zhyncs commented Mar 1, 2024 • edited Loading

zhyncs commented Mar 1, 2024

zhyncs commented Feb 28, 2024 •

edited

Loading

zhyncs commented Mar 1, 2024 •

edited

Loading