Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecation of old backends #1651

Closed
mudler opened this issue Jan 26, 2024 · 0 comments · Fixed by #1689
Closed

Deprecation of old backends #1651

mudler opened this issue Jan 26, 2024 · 0 comments · Fixed by #1689
Labels
enhancement New feature or request

Comments

@mudler
Copy link
Owner

mudler commented Jan 26, 2024

Is your feature request related to a problem? Please describe.
There are several backends that would be legacy by now, as llama.cpp enhanced support for different architectures via ggml over time.

Some of them include falcon-ggml and dolly for instance.

This card is about removing support for old backends, not for removing support family (for instance, starcoder is supported by llama.cpp, so no need to have a starcoder backend based out of ggml).

Tracked in #1126

@mudler mudler added the enhancement New feature or request label Jan 26, 2024
@mudler mudler pinned this issue Jan 26, 2024
@mudler mudler unpinned this issue Feb 11, 2024
truecharts-admin referenced this issue in truecharts/public Feb 12, 2024
….0 by renovate (#18178)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda11-ffmpeg-core` ->
`v2.8.0-cublas-cuda11-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda11-core` -> `v2.8.0-cublas-cuda11-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda12-ffmpeg-core` ->
`v2.8.0-cublas-cuda12-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-cublas-cuda12-core` -> `v2.8.0-cublas-cuda12-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0-ffmpeg-core` -> `v2.8.0-ffmpeg-core` |
| [docker.io/localai/localai](https://togithub.com/mudler/LocalAI) |
minor | `v2.7.0` -> `v2.8.0` |

---

> [!WARNING]
> Some dependencies could not be looked up. Check the Dependency
Dashboard for more information.

---

### Release Notes

<details>
<summary>mudler/LocalAI (docker.io/localai/localai)</summary>

### [`v2.8.0`](https://togithub.com/mudler/LocalAI/releases/tag/v2.8.0)

[Compare
Source](https://togithub.com/mudler/LocalAI/compare/v2.7.0...v2.8.0)

This release adds support for Intel GPUs, and it deprecates old
ggml-based backends which are by now superseded by llama.cpp (that now
supports more architectures out-of-the-box). See also
[https://github.com/mudler/LocalAI/issues/1651](https://togithub.com/mudler/LocalAI/issues/1651).

Images are now based on Ubuntu 22.04 LTS instead of Debian bullseye.

##### Intel GPUs

There are now images tagged with "sycl". There are sycl-f16 and sycl-f32
images indicating f16 or f32 support.

For example, to start phi-2 with an Intel GPU it is enough to use the
container image like this:

docker run -e DEBUG=true -ti -v $PWD/models:/build/models -p 8080:8080
-v /dev/dri:/dev/dri --rm
quay.io/go-skynet/local-ai:master-sycl-f32-ffmpeg-core phi-2

##### What's Changed

##### Exciting New Features 🎉

- feat(sycl): Add support for Intel GPUs with sycl
([#&#8203;1647](https://togithub.com/mudler/LocalAI/issues/1647)) by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1660](https://togithub.com/mudler/LocalAI/pull/1660)
- Drop old falcon backend (deprecated) by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1675](https://togithub.com/mudler/LocalAI/pull/1675)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1678](https://togithub.com/mudler/LocalAI/pull/1678)
- Drop ggml-based gpt2 and starcoder (supported by llama.cpp) by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1679](https://togithub.com/mudler/LocalAI/pull/1679)
- fix(Dockerfile): sycl dependencies by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1686](https://togithub.com/mudler/LocalAI/pull/1686)
- feat: Use ubuntu as base for container images, drop deprecated
ggml-transformers backends by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1689](https://togithub.com/mudler/LocalAI/pull/1689)

##### 👒 Dependencies

- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1656](https://togithub.com/mudler/LocalAI/pull/1656)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1665](https://togithub.com/mudler/LocalAI/pull/1665)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1669](https://togithub.com/mudler/LocalAI/pull/1669)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1673](https://togithub.com/mudler/LocalAI/pull/1673)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1683](https://togithub.com/mudler/LocalAI/pull/1683)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1688](https://togithub.com/mudler/LocalAI/pull/1688)
- ⬆️ Update mudler/go-stable-diffusion by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1674](https://togithub.com/mudler/LocalAI/pull/1674)

##### Other Changes

- ⬆️ Update docs version mudler/LocalAI by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1661](https://togithub.com/mudler/LocalAI/pull/1661)
- feat(mamba): Add bagel-dpo-2.8b by
[@&#8203;richiejp](https://togithub.com/richiejp) in
[https://github.com/mudler/LocalAI/pull/1671](https://togithub.com/mudler/LocalAI/pull/1671)
- fix (docs): fixed broken links `github/` -> `github.com/` by
[@&#8203;Wansmer](https://togithub.com/Wansmer) in
[https://github.com/mudler/LocalAI/pull/1672](https://togithub.com/mudler/LocalAI/pull/1672)
- Fix HTTP links in README.md by
[@&#8203;vfiftyfive](https://togithub.com/vfiftyfive) in
[https://github.com/mudler/LocalAI/pull/1677](https://togithub.com/mudler/LocalAI/pull/1677)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1681](https://togithub.com/mudler/LocalAI/pull/1681)
- ci: cleanup worker before run by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1685](https://togithub.com/mudler/LocalAI/pull/1685)
- Revert "fix(Dockerfile): sycl dependencies" by
[@&#8203;mudler](https://togithub.com/mudler) in
[https://github.com/mudler/LocalAI/pull/1687](https://togithub.com/mudler/LocalAI/pull/1687)
- ⬆️ Update ggerganov/llama.cpp by
[@&#8203;localai-bot](https://togithub.com/localai-bot) in
[https://github.com/mudler/LocalAI/pull/1691](https://togithub.com/mudler/LocalAI/pull/1691)

##### New Contributors

- [@&#8203;richiejp](https://togithub.com/richiejp) made their first
contribution in
[https://github.com/mudler/LocalAI/pull/1671](https://togithub.com/mudler/LocalAI/pull/1671)
- [@&#8203;Wansmer](https://togithub.com/Wansmer) made their first
contribution in
[https://github.com/mudler/LocalAI/pull/1672](https://togithub.com/mudler/LocalAI/pull/1672)
- [@&#8203;vfiftyfive](https://togithub.com/vfiftyfive) made their first
contribution in
[https://github.com/mudler/LocalAI/pull/1677](https://togithub.com/mudler/LocalAI/pull/1677)

**Full Changelog**:
mudler/LocalAI@v2.7.0...v2.8.0

</details>

---

### Configuration

📅 **Schedule**: Branch creation - "before 10pm on monday" in timezone
Europe/Amsterdam, Automerge - At any time (no schedule defined).

🚦 **Automerge**: Enabled.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the
rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about these
updates again.

---

- [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check
this box

---

This PR has been generated by [Renovate
Bot](https://togithub.com/renovatebot/renovate).

<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNy4xODMuMCIsInVwZGF0ZWRJblZlciI6IjM3LjE4My4wIiwidGFyZ2V0QnJhbmNoIjoibWFzdGVyIn0=-->
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
1 participant