A descriptive title for your project goes here.
This project focuses on developing and implementing machine learning models using PyTorch. It includes various Jupyter Notebooks demonstrating different aspects of model training, tokenization, and inference. Key components include:
train.ipynb
: Training models using PyTorch.inference.ipynb
: Running inference with trained models.mha_GPT.ipynb
andmy_GPT.ipynb
: Implementations of custom GPT models.train_mha.ipynb
,train_mha_avec_mp.ipynb
, andtrain_orca.ipynb
: Advanced training techniques with multi-head attention and other optimizations.dataset_bes.ipynb
: Notebook for handling and preparing the dataset.tinystories_tokeniser.model
andtinystories_tokeniser.vocab
: Tokenizer files for processing text data.
To run the notebooks, ensure you have Jupyter and PyTorch installed. You can install these using pip:
pip install torch torchvision torchaudio
pip install jupyterlab
- Open the desired .ipynb file in Jupyter Lab or Notebook.
- Follow the instructions within each notebook.
Contributions to this project are welcome. Please fork the repository and submit a pull request with your changes.
This project is licensed under the MIT License - see the LICENSE.md file for details.
Thanks to great Recurrent Rebels team