Skip to content

[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models. #68

[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models.

[Spec Decode] Disable Log Prob serialization to CPU for spec decoding for both draft and target models. #68