Deepspeed-FastGen: Missing Precision Evaluation #4618

m1mc · 2023-11-04T08:51:23Z

m1mc
Nov 4, 2023

In the existing doc of novel SplitFuse, the long prompt is decomposed for early token generation in forward pass. For pretrained LLMs from say huggingface, I think the decomposition will degrade the precision unless the LLM is also trained using SplitFuse, won’t it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepspeed-FastGen: Missing Precision Evaluation #4618

{{title}}

Replies: 0 comments

Select a reply

Deepspeed-FastGen: Missing Precision Evaluation #4618

m1mc Nov 4, 2023

Replies: 0 comments

m1mc
Nov 4, 2023