Skip to content

Latest commit

 

History

History

stress-test

Stress test

T5 on RTX 3090

Rate of 5 users per second, total requests up to 50 users for 60 seconds,

locust -f t5.py -P 7001 -H http://localhost:7088 -r 5 -u 50 -t 60

alt text

Llama 3.2 1B Instruct on RTX 3090

Rate of 5 users per second, total requests up to 50 users for 60 seconds,

locust -f llama3.2-1b.py -P 7001 -H http://localhost:7088 -r 5 -u 50 -t 60

alt text

Whisper Large V3 on RTX 3090

Rate of 5 users per second, total requests up to 30 users for 60 seconds,

locust -f whisper.py -P 7001 -H http://localhost:7088 -r 5 -u 30 -t 60

alt text