Skip to content

Commit

Permalink
Update demo video in README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
hodlen committed Dec 16, 2023
1 parent 64d83e1 commit 7b699b0
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,9 +3,9 @@

*Demo* 🔥

https://github.com/hodlen/PowerInfer/assets/34213478/b782ccc8-0a2a-42b6-a6aa-07b2224a66f7
https://github.com/SJTU-IPADS/PowerInfer/assets/34213478/d26ae05b-d0cf-40b6-8788-bda3fe447e28

<sub>The demo is running with a single 24G 4090 GPU, the model is Falcon (ReLU)-40B, and the precision is FP16.</sub>
<sub>PowerInfer v.s. llama.cpp on a single RTX 4090(24G) running Falcon(ReLU)-40B-FP16 with a 11x speedup!</sub>

---
## Abstract
Expand Down

0 comments on commit 7b699b0

Please sign in to comment.