v0.0.20: Multi-node training, SD Lora, sentence transformers clip, TGI improvements
What's Changed
Training
- Multi-node training support by @michaelbenayoun (#440)
TGI
- optimize continuous batching and improve export (#506)
Inference
- Add Lora support to stable diffusion by @JingyaHuang (#483)
- Support sentence transformers clip by @JingyaHuang (#495)
- Inference compile cache script by @philschmid and @dacorvo (#496, #504)
Doc
- Update Inference supported models list by @JingyaHuang (#501)
Bug fixes
- inference cache: omit irrelevant config parameters in lookup dy @dacorvo (#494)
- Optimize disk usage when fetching model checkpoints by @dacorvo (#505)
Full Changelog: v0.0.19...v0.0.20