Free Colab #12

rarhs · 2024-05-22T08:11:00Z

Can't run on free colab due to not having adequate RAM.

AbnerAI · 2024-05-22T09:27:08Z

How much CPU/GPU resoures are required?

wdndev · 2024-05-25T07:36:26Z

Hello, I have an idea.

Because the model is loaded by the CPU, I am using my notebook (16G RAM) running and cannot load the Llama3-8B model.

So, I took the first two Transformers layers from the 32-layers architecture of the Llama3-8B model to form a new model. This can be run in a notebook with 16G RAM, occupying about 4~5G RAM, but the final decoding result is wrong, and the middle result is all correct.

You can try it with this model.

Haggingface link: https://huggingface.co/wdndev/Meta-Llama-3-8B-Instruct-2layers
ModeScope link: https://www.modelscope.cn/models/wdndev/Meta-Llama-3-8B-Instruct-2layers

and the colab link, it can run directly:

llama3-from-scratch-en: https://colab.research.google.com/drive/1X9yEa4hAZzgrwTuxHValBoVt1qfx6AXv?usp=sharing
llama3-from-scratch-zh: https://colab.research.google.com/drive/11MQb8Bn4Ck707VEcqqGVdytqOk3OrQQK?usp=sharing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free Colab #12

Free Colab #12

rarhs commented May 22, 2024

AbnerAI commented May 22, 2024

wdndev commented May 25, 2024 •

edited

Loading

Free Colab #12

Free Colab #12

Comments

rarhs commented May 22, 2024

AbnerAI commented May 22, 2024

wdndev commented May 25, 2024 • edited Loading

wdndev commented May 25, 2024 •

edited

Loading