Add use_exact_model_name option to prevent automatic model name modification #1339

niryuu · 2024-11-26T11:08:41Z

When loading quantized models, Unsloth automatically modifies the model name to load optimized versions. While this is helpful in most cases, it can lead to duplicate model caching when users specifically want to load both original and quantized ways.

This PR adds a new use_exact_model_name parameter that allows users to bypass this automatic modification and load the exact model specified.

Example:

model, tokenizer = FastLanguageModel.from_pretrained(
    "google/gemma-2-9b",
    load_in_4bit=True,
    use_exact_model_name=True  # Will load exactly this model without modification
)

danielhanchen · 2024-11-26T22:09:11Z

So the goal is instead of loading google/gemma-2-9b-bnb-4bit, it should load directly from the cache of google/gemma-2-9b?

niryuu · 2024-11-26T22:25:43Z

Yes. Unsloth has a mechanism for rewriting the given model name according to unsloth/models/mapper.py for efficiency. However, there are cases where we want to use the exact model name, such as cache control. The purpose of this PR is to provide an option for this case.

use exact model name

12f97c8

niryuu force-pushed the use-exact-model-name branch from 6d2ea6d to 12f97c8 Compare November 26, 2024 11:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add use_exact_model_name option to prevent automatic model name modification #1339

Add use_exact_model_name option to prevent automatic model name modification #1339

niryuu commented Nov 26, 2024

danielhanchen commented Nov 26, 2024

niryuu commented Nov 26, 2024 •

edited

Loading

Add use_exact_model_name option to prevent automatic model name modification #1339

Are you sure you want to change the base?

Add use_exact_model_name option to prevent automatic model name modification #1339

Conversation

niryuu commented Nov 26, 2024

danielhanchen commented Nov 26, 2024

niryuu commented Nov 26, 2024 • edited Loading

niryuu commented Nov 26, 2024 •

edited

Loading