docs: readme

Signed-off-by: thxCode <[email protected]>
gpustack · Aug 29, 2024 · 9f94118 · 9f94118
1 parent 269d4ba
commit 9f94118
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -793,7 +793,7 @@ $ gguf-parser --hf-repo="QuantFactory/Meta-Llama-3-8B-Instruct-GGUF" --hf-file="
 Use `--gpu-layers-step` to get the proper offload layers number when the model is too large to fit into the GPUs memory.
 
 ```shell
-$ gguf-parser --hf-repo="etemiz/Llama-3.1-405B-Inst-GGUF" --hf-file="llama-3.1-405b-IQ1_M-00019-of-00019.gguf" --skip-metadata --skip-architecture --skip-tokenizer --gpu-layers-step=10 --in-short
+$ gguf-parser --hf-repo="etemiz/Llama-3.1-405B-Inst-GGUF" --hf-file="llama-3.1-405b-IQ1_M-00019-of-00019.gguf" --skip-metadata --skip-architecture --skip-tokenizer --gpu-layers-step=1 --in-short
 +------------------------------------------------------------------------------------------------------+
 | ESTIMATE                                                                                             |
 +----------------+----------------+-----------------------------------+--------------------------------+