You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
tdoublep
changed the title
Enable ITL, TTFT computation using mean rather than median
Enable ITL, TTFT, E2E latency computation using mean rather than median
Jun 19, 2024
Let's have it configurable in the parser, and even maybe make mean the default.
Median does not make sense with speculative decoding anyway.
The text was updated successfully, but these errors were encountered: