From 9a34d633d001e50eb03760ee834c0c5a32ede80a Mon Sep 17 00:00:00 2001 From: IanNod <45800100+IanNod@users.noreply.github.com> Date: Tue, 3 Dec 2024 09:23:38 -0800 Subject: [PATCH] Update halo-models.md --- halo-models.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/halo-models.md b/halo-models.md index fa16939..6b8e060 100644 --- a/halo-models.md +++ b/halo-models.md @@ -34,7 +34,7 @@ See latest [CI/Nightly Test Report](https://nod-ai.github.io/shark-ai/). Use [No |------------------------------|-----------------------|--------------------------| | Sharktank Modeling |
- @Boian Add CLIP encoder to sharktank (ETA: 11/27)
-@Dan fix numeric fp8 issue (ETA: 11/26) | - @Boian CLIP encoder (ETA: 12/5)
- @Rob CI llama regression tests (ETA 12/3)
- @Ian Finish VAE decode (ETA: 12/5) | IREE codegeneration |- @kunvar support for non deocmposed decode (ETA: 11/27)
- @stan: FP8 attention (ETA: 11/27) | -| Serving |
- @stephen / @xida implement Radix Attention in shortfin (ETA: 12/6)
- @egarvey wire up Flux.1 in shortfin using ONNX model (ETA:11/27) | - @eagarvey finish Flux pipeline for image generation (ETA: 12/2) | - @Stephen Debug CI flakiness (ETA: 12/2)
- @Xida landing PR's for attention changes (ETA: 12/3) +| Serving |
- @stephen / @xida implement Radix Attention in shortfin (ETA: 12/6)
- @egarvey wire up Flux.1 in shortfin using ONNX model (ETA:11/27) |
- @eagarvey finish Flux pipeline for image generation (ETA: 12/3)
- @Stephen Debug CI flakiness (ETA: 12/2)
- @Xida landing PR's for attention changes (ETA: 12/3) | Test Automation |
- @Avi Work with codegen folks to get 405B FP16 fixed and tested (ETA: 11/18) | - @Avi benchmarking dashboard (ETA: 12/3)
- @Archana shortfin regression tests (ETA: 12/3) | Performance Tuning | |