[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and prompt_logprobs
with ChunkedPrefill
#1877
Job | Run time |
---|---|
2s | |
2s |
prompt_logprobs
with ChunkedPrefill
#1877
Job | Run time |
---|---|
2s | |
2s |