Add support for Llama3.3 #2661

jorgeantonio21 · 2024-12-06T21:39:18Z

https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct

zackangelo · 2024-12-09T15:24:58Z

There were no architectural changes afaik, it should already work with the existing example, just change the repo URL.

theHausdorffMetric · 2024-12-22T15:48:13Z

I added the respective repo url to the llama example and created a pull request:
#2677

LaurentMazare · 2024-12-22T15:59:13Z

Were you able to try it out? I don't think it would fit in the memory of a single gpu so this would be better suited for llama_multiprocess that can use multiple gpus. Would be great if someone can give it a spin there and check that it works.

zackangelo · 2024-12-22T16:42:09Z

Ah, you're right @LaurentMazare.

I don't believe the multiprocess example has been updated to create the new rope tensors that were introduced in Llama 3.1. But it should work otherwise.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Llama3.3 #2661

Add support for Llama3.3 #2661

jorgeantonio21 commented Dec 6, 2024 •

edited

Loading

zackangelo commented Dec 9, 2024

theHausdorffMetric commented Dec 22, 2024

LaurentMazare commented Dec 22, 2024

zackangelo commented Dec 22, 2024

Add support for Llama3.3 #2661

Add support for Llama3.3 #2661

Comments

jorgeantonio21 commented Dec 6, 2024 • edited Loading

zackangelo commented Dec 9, 2024

theHausdorffMetric commented Dec 22, 2024

LaurentMazare commented Dec 22, 2024

zackangelo commented Dec 22, 2024

jorgeantonio21 commented Dec 6, 2024 •

edited

Loading