what is the practical speedup ? #5

XA23i · 2024-01-04T13:15:58Z

interesting work,
Since some salient parameters have not been binarized, I am curious about the practical speedup in comparison to floating-point models. Do you utilize some GPU kernel to accelerate inference?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

what is the practical speedup ? #5

what is the practical speedup ? #5

XA23i commented Jan 4, 2024

what is the practical speedup ? #5

what is the practical speedup ? #5

Comments

XA23i commented Jan 4, 2024