fix/load-checkpoint-add-new-tokens #1225

Erland366 · 2024-10-31T12:42:33Z

Given this issue where we can't immediately use the changed vocab size because the difference size between the adapter and base model, we need to resize the base model before merging the LoRA into base model.

Note this need changes to the unsloth-zoo since we need a modification of it. which I also create a PR of it

unslothai/unsloth-zoo#9

Erland366 · 2024-10-31T12:46:41Z

I need a discussion about the embedding tho since I did not implement specification to specify the method to extend the embedding. So for example, when training the embedding, the user specify to use interpolation. Then when we load the checkpoint and resize the base model again, we need to make sure that the resize method is the same as in training.

Maybe we can store additional params in the model.config of the method? then we can pass it when we load the checkpoint and resize?

Erland366 · 2024-10-31T12:51:35Z

Also while here, seems like the value of tokenizer.vocab_size is unchanged when we do add_new_tokens. Is tokenizer.vocab_size only consider non special tokens and since we add all of the new tokens to the special tokens, that's why the attribute value is not increasing?

Erland366 · 2024-10-31T13:56:49Z

https://colab.research.google.com/drive/1xBxY_L48Lzu5SJjukPExgoWVthoyTGCA?usp=sharing

reproducible of this fix

Add functionality to update model vocabulary with new tokenizer tokens

bcaa5b0

Erland366 mentioned this pull request Oct 31, 2024

feat-resize-tokenizer-add-new-tokens unslothai/unsloth-zoo#9

Open

Erland366 changed the title ~~Add functionality to update model vocabulary with new tokenizer tokens~~ fix/load-checkpoint-add-new-tokens Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix/load-checkpoint-add-new-tokens #1225

fix/load-checkpoint-add-new-tokens #1225

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024

fix/load-checkpoint-add-new-tokens #1225

Are you sure you want to change the base?

fix/load-checkpoint-add-new-tokens #1225

Conversation

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024

Erland366 commented Oct 31, 2024