Skip to content

Improving Gemma 2 for a Specific Language on a Budget: Post-Training Recipe ๐Ÿ’Ž๐ŸŒ๐Ÿ‡ฎ๐Ÿ‡น

License

Notifications You must be signed in to change notification settings

anakin87/gemma-neogenesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

22 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ’Ž๐ŸŒ๐Ÿ‡ฎ๐Ÿ‡น Gemma Neogenesis

Improving Gemma 2 for a Specific Language on a Budget: Post-Training Recipe

Additional resources for Gemma Neogenesis, a ๐Ÿ““ Kaggle notebook for improving Gemma 2 for a specific language on a budget. The notebook participates to the Kaggle competition: Google - Unlock Global Communication with Gemma.

Gemma Neogenesis

Notebook intro

The notebook demonstrates a case study on improving Gemma 2 2B's performance in Italian through Post-Training, combining Supervised Fine Tuning and Preference Tuning. The process uses both existing datasets and synthetic data generated specifically for this competition. While focused on Italian, the cost-effective methods demonstrated can inspire similar fine-tuning approaches for other languages.

๐Ÿ‘ฃ Navigating this repository

About

Improving Gemma 2 for a Specific Language on a Budget: Post-Training Recipe ๐Ÿ’Ž๐ŸŒ๐Ÿ‡ฎ๐Ÿ‡น

Resources

License

Stars

Watchers

Forks