Skip to content

Commit f54eba2

Browse files
committed
More updates
Signed-off-by: Frank Di Natale <[email protected]>
1 parent 8d368ce commit f54eba2

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

docs/source/commands/trtllm-bench.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -215,7 +215,7 @@ Run latency benchmark:
215215

216216
.. code-block:: bash
217217
218-
trtllm-bench -m <model_name> --engine_dir <engine_path> --kv_cache_free_gpu_mem_fraction <kv_cache_free_gpu_mem_fraction> --dataset <dataset_path> --num_requests <num_requests> --warmup <warmup> --concurrency <concurrency> --beam_width <beam_width> --medusa_choices <medusa_choices> --report_json <report_json> --iteration_log <iteration_log>
218+
trtllm-bench -m <model_name> latency --engine_dir <engine_path> --dataset <dataset_path>
219219
220220
Dataset Preparation
221221
------------------
@@ -336,7 +336,7 @@ trtllm-bench is designed to work with the ``prepare_dataset.py`` script, which g
336336

337337
.. code-block:: bash
338338
339-
python prepare_dataset.py --tokenizer meta-llama/Meta-Llama-3.3-8B --stdout dataset --output benchmark_data.jsonl
339+
python prepare_dataset.py --tokenizer <model_name> --stdout dataset --output benchmark_data.jsonl
340340
341341
This workflow allows you to:
342342

0 commit comments

Comments
 (0)