feat(transformers): support also text generation #1630

mudler · 2024-01-23T17:44:25Z

Description

Related to #1126. Up to now this backend was only supporting embeddings.

this PR adds text generation support also with transformers AutoModel/AutoTokenizer.

Notes for Reviewers

Signed commits

Yes, I signed my commits.

Signed-off-by: Ettore Di Giacinto <[email protected]>

netlify · 2024-01-23T17:44:31Z

✅ Deploy Preview for localai canceled.

Name	Link
🔨 Latest commit	`50bc9a4`
🔍 Latest deploy log	https://app.netlify.com/sites/localai/deploys/65affe37b59c00000860b0b7

….0@b689c91 by renovate (#17756) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | minor | `v2.6.1` -> `v2.7.0` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)  This release adds support to the transformer backend for LLM as well! For now instance you can run codellama-7b with transformers with: docker run -ti -p 8080:8080 --gpus all localai/localai:v2.7.0-cublas-cuda12 codellama-7b In the quickstart there are more examples available https://localai.io/basics/getting_started/#running-models. Note: As llama.cpp is ongoing with changes that could possible cause breakage, this release does not includes changes from [ggerganov/llama.cpp#5138 (the future versions will). #### What's Changed ##### Bug fixes 🐛 - fix(paths): automatically create paths by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1650](https://togithub.com/mudler/LocalAI/pull/1650) ##### Exciting New Features 🎉 - feat(transformers): support also text generation by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1630](https://togithub.com/mudler/LocalAI/pull/1630) - transformers: correctly load automodels by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1643](https://togithub.com/mudler/LocalAI/pull/1643) - feat(startup): fetch model definition remotely by [@mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1654](https://togithub.com/mudler/LocalAI/pull/1654) ##### 👒 Dependencies - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1642](https://togithub.com/mudler/LocalAI/pull/1642) - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1644](https://togithub.com/mudler/LocalAI/pull/1644) - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1652](https://togithub.com/mudler/LocalAI/pull/1652) - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1655](https://togithub.com/mudler/LocalAI/pull/1655) ##### Other Changes - ⬆️ Update ggerganov/llama.cpp by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1632](https://togithub.com/mudler/LocalAI/pull/1632) - ⬆️ Update docs version mudler/LocalAI by [@localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1631](https://togithub.com/mudler/LocalAI/pull/1631) **Full Changelog**: mudler/LocalAI@v2.6.1...v2.6.2 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

….0@b689c91 by renovate (truecharts#17756) This PR contains the following updates: | Package | Update | Change | |---|---|---| | [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) | minor | `v2.6.1` -> `v2.7.0` | --- > [!WARNING] > Some dependencies could not be looked up. Check the Dependency Dashboard for more information. --- ### Release Notes <details> <summary>mudler/LocalAI (docker.io/localai/localai)</summary> ### [`v2.7.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.7.0) [Compare Source](https://togithub.com/mudler/LocalAI/compare/v2.6.1...v2.7.0)  This release adds support to the transformer backend for LLM as well! For now instance you can run codellama-7b with transformers with: docker run -ti -p 8080:8080 --gpus all localai/localai:v2.7.0-cublas-cuda12 codellama-7b In the quickstart there are more examples available https://localai.io/basics/getting_started/#running-models. Note: As llama.cpp is ongoing with changes that could possible cause breakage, this release does not includes changes from [ggerganov/llama.cpp#5138 (the future versions will). #### What's Changed ##### Bug fixes 🐛 - fix(paths): automatically create paths by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1650](https://togithub.com/mudler/LocalAI/pull/1650) ##### Exciting New Features 🎉 - feat(transformers): support also text generation by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1630](https://togithub.com/mudler/LocalAI/pull/1630) - transformers: correctly load automodels by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1643](https://togithub.com/mudler/LocalAI/pull/1643) - feat(startup): fetch model definition remotely by [@&truecharts#8203;mudler](https://togithub.com/mudler) in [https://github.com/mudler/LocalAI/pull/1654](https://togithub.com/mudler/LocalAI/pull/1654) ##### 👒 Dependencies - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1642](https://togithub.com/mudler/LocalAI/pull/1642) - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1644](https://togithub.com/mudler/LocalAI/pull/1644) - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1652](https://togithub.com/mudler/LocalAI/pull/1652) - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1655](https://togithub.com/mudler/LocalAI/pull/1655) ##### Other Changes - ⬆️ Update ggerganov/llama.cpp by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1632](https://togithub.com/mudler/LocalAI/pull/1632) - ⬆️ Update docs version mudler/LocalAI by [@&truecharts#8203;localai-bot](https://togithub.com/localai-bot) in [https://github.com/mudler/LocalAI/pull/1631](https://togithub.com/mudler/LocalAI/pull/1631) **Full Changelog**: mudler/LocalAI@v2.6.1...v2.6.2 </details> --- ### Configuration 📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone Europe/Amsterdam, Automerge - At any time (no schedule defined). 🚦 **Automerge**: Enabled. ♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 **Ignore**: Close this PR and you won't be reminded about this update again. --- - [ ] If you want to rebase/retry this PR, check this box --- This PR has been generated by [Renovate Bot](https://togithub.com/renovatebot/renovate).

feat(transformers): support also text generation

389e912

Signed-off-by: Ettore Di Giacinto <[email protected]>

embedded: set seed -1

50bc9a4

mudler merged commit 5e335ea into master Jan 23, 2024
24 checks passed

mudler deleted the transformers_automodel branch January 23, 2024 22:07

mudler added the enhancement New feature or request label Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(transformers): support also text generation #1630

feat(transformers): support also text generation #1630

mudler commented Jan 23, 2024 •

edited

Loading

netlify bot commented Jan 23, 2024 •

edited

Loading

feat(transformers): support also text generation #1630

feat(transformers): support also text generation #1630

Conversation

mudler commented Jan 23, 2024 • edited Loading

netlify bot commented Jan 23, 2024 • edited Loading

✅ Deploy Preview for localai canceled.

mudler commented Jan 23, 2024 •

edited

Loading

netlify bot commented Jan 23, 2024 •

edited

Loading