Hawk - PyTorch

This is a PyTorch implementation of Hawk from Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models. It uses a custom Triton kernel for the sequential scan and supports torch.compile. Because of this, it is the fastest implementation of Hawk available for GPU.

Install

git clone https://github.com/fattorib/hawk-pytorch
cd hawk-pytorch
pip install -e .

Usage

import torch 
from hawk import HawkModel, HawkConfig

config = HawkConfig(vocab_size=32000, 
                    hidden_size=512, 
                    intermediate_size=1024, 
                    recurrent_size=512, 
                    num_hidden_layers=8, 
                    num_blocks = 16,
                    post_norm=False)


model = HawkModel(config, use_cache=False)

model.to('cuda')
model = torch.compile(model) # this works!

x = torch.randint(size=(1, 2048), low=1, high=32000, device="cuda:0")
with torch.autocast(device_type = 'cuda', dtype=torch.bfloat16):
    loss = model(x, x)
loss.backward()

Citations

@misc{de2024griffinmixinggatedlinear,
      title={Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models}, 
      author={Soham De and Samuel L. Smith and Anushan Fernando and Aleksandar Botev and George Cristian-Muraru and Albert Gu and Ruba Haroun and Leonard Berrada and Yutian Chen and Srivatsan Srinivasan and Guillaume Desjardins and Arnaud Doucet and David Budden and Yee Whye Teh and Razvan Pascanu and Nando De Freitas and Caglar Gulcehre},
      year={2024},
      eprint={2402.19427},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2402.19427}, 
}

Code in hawk/external.py taken from google-deepmind/recurrentgemma

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
hawk		hawk
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hawk - PyTorch

Install

Usage

Citations

About

Releases

Packages

Languages

License

fattorib/hawk-pytorch

Folders and files

Latest commit

History

Repository files navigation

Hawk - PyTorch

Install

Usage

Citations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages