Skip to content

RFC: FP8 Quantization Schema in vLLM update #5802

HaiShaw started this conversation in Ideas
Jun 24, 2024 · 2 comments · 4 replies
Discussion options

You must be logged in to vote

Replies: 2 comments 4 replies

Comment options

You must be logged in to vote
3 replies
@HaiShaw
Comment options

@mgoin
Comment options

mgoin Jun 25, 2024
Collaborator Sponsor

@comaniac
Comment options

Comment options

You must be logged in to vote
1 reply
@HaiShaw
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Ideas
Labels
None yet
4 participants