Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Refactor benchmark_throughput.py to pass TextPrompt instead of string
* This is preparation to support multi-modality input, by reusing existing TextPrompt structure * no significant metrics diff, see below - before: Throughput: 13.99 requests/s, 2933.11 total tokens/s, 2758.10 output tokens/s - after: Throughput: 13.99 requests/s, 2932.69 total tokens/s, 2757.70 output tokens/s - test command: `python benchmarks/benchmark_throughput.py --model mistral-community/pixtral-12b --max-model-len=8192 --dataset ../sharegpt4v_instruct_gpt4-vision_cap100k.json`
- Loading branch information