[Speculative decoding] Support num_of_seqs
in scheduler and extend CB Benchmark by SD parameters
#3429
Job | Run time |
---|---|
32m 37s | |
30m 32s | |
1s | |
1h 3m 10s |