-
Notifications
You must be signed in to change notification settings - Fork 6
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update vllm's lm-format-enforcer benchmark
- Loading branch information
1 parent
40f4632
commit 7100cfd
Showing
5 changed files
with
113 additions
and
53 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,12 +1,24 @@ | ||
llama3_8b_vllm_address generated 1085 tokens with 40.81784246815824 tps (with warm up) | ||
llama3_8b_vllm_address unconstrained generated 1526 tokens with 41.941802458298596 tps | ||
llama3_8b_linkedlist generated 1252 tokens with 40.55915109806643 tps (with warm up) | ||
llama3_8b_linkedlist unconstrained generated 1389 tokens with 41.85204307258889 tps | ||
llama3_8b_orders generated 4266 tokens with 40.055533063427156 tps (with warm up) | ||
llama3_8b_orders unconstrained generated 4595 tokens with 41.46372280395214 tps | ||
llama2_7b_vllm_address generated 1956 tokens with 46.56639728880725 tps (with warm up) | ||
llama2_7b_vllm_address unconstrained generated 1723 tokens with 47.53091890475664 tps | ||
llama2_7b_linkedlist generated 1590 tokens with 46.51200582753253 tps (with warm up) | ||
llama2_7b_linkedlist unconstrained generated 1915 tokens with 47.54641610018721 tps | ||
llama2_7b_orders generated 5073 tokens with 45.7052185686332 tps (with warm up) | ||
llama2_7b_orders unconstrained generated 5120 tokens with 46.68000079667531 tps | ||
formatron_llama3_8b_address generated 837 tokens with 40.92543858162237 tps (with warm up) | ||
formatron_llama3_8b_address unconstrained generated 1517 tokens with 42.03973057981289 tps | ||
lm_format_enforcer_llama3_8b_address generated 900 tokens with 38.382952379822854 tps (with warm up) | ||
lm_format_enforcer_llama3_8b_address unconstrained generated 1557 tokens with 41.95753842260187 tps | ||
formatron_llama3_8b_linkedlist generated 1253 tokens with 40.65510144702983 tps (with warm up) | ||
formatron_llama3_8b_linkedlist unconstrained generated 1280 tokens with 41.96170843472007 tps | ||
lm_format_enforcer_llama3_8b_linkedlist generated 1046 tokens with 41.49508056791946 tps (with warm up) | ||
lm_format_enforcer_llama3_8b_linkedlist unconstrained generated 1370 tokens with 41.93690985976558 tps | ||
formatron_llama3_8b_orders generated 3846 tokens with 40.30895091864163 tps (with warm up) | ||
formatron_llama3_8b_orders unconstrained generated 4577 tokens with 41.55286165339125 tps | ||
lm_format_enforcer_llama3_8b_order generated 3767 tokens with 40.31444158325534 tps (with warm up) | ||
lm_format_enforcer_llama3_8b_order unconstrained generated 4581 tokens with 41.55670827797047 tps | ||
formatron_llama2_7b_address generated 1353 tokens with 46.79902509681338 tps (with warm up) | ||
formatron_llama2_7b_address unconstrained generated 2000 tokens with 47.6050425184716 tps | ||
lm_format_enforcer_llama2_7b_address generated 1614 tokens with 46.85370459223725 tps (with warm up) | ||
lm_format_enforcer_llama2_7b_address unconstrained generated 2000 tokens with 47.591727879190344 tps | ||
formatron_llama2_7b_linkedlist generated 1022 tokens with 46.64185711280541 tps (with warm up) | ||
formatron_llama2_7b_linkedlist unconstrained generated 2000 tokens with 47.655055230835984 tps | ||
lm_format_enforcer_llama2_7b_linkedlist generated 1432 tokens with 46.83670664952844 tps (with warm up) | ||
lm_format_enforcer_llama2_7b_linkedlist unconstrained generated 2000 tokens with 47.650351474620415 tps | ||
formatron_llama2_7b_orders generated 4596 tokens with 45.86432176919425 tps (with warm up) | ||
formatron_llama2_7b_orders unconstrained generated 5120 tokens with 46.742264169348076 tps | ||
lm_format_enforcer_llama2_7b_orders generated 4834 tokens with 45.98683334304331 tps (with warm up) | ||
lm_format_enforcer_llama2_7b_orders unconstrained generated 5021 tokens with 46.744708931481135 tps |