Skip to content

Commit

Permalink
Config: Update Q4 in comments
Browse files Browse the repository at this point in the history
Wasn't present when the option was added.

Signed-off-by: kingbri <[email protected]>
  • Loading branch information
kingbri1 committed Mar 17, 2024
1 parent 14d8ec2 commit 7abbac0
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion config_sample.yml
Original file line number Diff line number Diff line change
Expand Up @@ -103,7 +103,8 @@ model:
# Disable Flash-attention 2. Set to True for GPUs lower than Nvidia's 3000 series. (default: False)
#no_flash_attention: False

# Enable 8 bit cache mode for VRAM savings (slight performance hit). Possible values FP16, FP8. (default: FP16)
# Enable 8 bit cache mode for VRAM savings (slight performance hit).
# Possible values FP16, FP8, Q4. (default: FP16)
#cache_mode: FP16

# Set the prompt template for this model. If empty, chat completions will be disabled. (default: Empty)
Expand Down

0 comments on commit 7abbac0

Please sign in to comment.