Skip to content

Commit

Permalink
fix lorax logo
Browse files Browse the repository at this point in the history
  • Loading branch information
Michaelvll committed Mar 19, 2024
1 parent 079f942 commit d42b1eb
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion llm/lorax/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@
<!-- $UNCOMMENT# LoRAX: Multi-LoRA Inference Server -->

<p align="center">
<img src="https://github.com/predibase/lorax/blob/main/docs/images/lorax_guy.png" alt="LoRAX" style="width:200px;" />
<img src="https://imgur.com/OUapRYC.png" alt="LoRAX" style="width:200px;" />
</p>

[LoRAX](https://github.com/predibase/lorax) (LoRA eXchange) is a framework that allows users to serve thousands of fine-tuned LLMs on a single GPU, dramatically reducing the cost of serving without compromising on throughput or latency. It works by dynamically loading multiple fine-tuned "adapters" (LoRAs, etc.) on top of a single base model at runtime. Concurrent requests for different adapters can be processed together in a single batch, allowing LoRAX to maintain near linear throughput scaling as the number of adapters increases.
Expand Down

0 comments on commit d42b1eb

Please sign in to comment.