v0.2.1
What's Changed
Full Changelog: v0.2.0...v0.2.1
New Features ✨
- Introduce a way to mix generated datasets before sending to training by @shivchander @khaledsulayman @abhi1092 @aakankshaduggal @bbrowning @markmc in #163 #215
- Introduce data mixing recipe yaml files by @shivchander @khaledsulayman @abhi1092 @aakankshaduggal @bbrowning @markmc in #203
- Add 4 new pipeline blocks by @abhi1092 @shivchander @derekhiggins @markmc in #182
- Generate data for model evaluation using the MMLU benchmark by @shivchander @abhi1092 @aakankshaduggal @derekhiggins @markmc in #180 #212 #209 #193
Fixes 🐛
- Remove temporary e2e hack to use knowledge v3 PR by @markmc in #187
- Remove sys_prompt from contexts.yaml by @shivchander @derekhiggins in #189
- Move Block._validate to llmblock by @abhi1092 @derekhiggins in #191
- generate_data: introduce argument
client
to replace 6 others by @makelinux @tiran in #114 - Fix logging string formatting by @derekhiggins in #197
- Add utility function to convert from Pandas dataframe to Hugging Face dataset by @hickeyma in #199
- Update ConditionalLLMBlock's config_paths schema by @derekhiggins in #211
- Move system pipelines to /usr/share/instructlab/sdg/pipelines by @markmc in #214
New Contributors
- @bbrowning made their first contribution in #163
- @hickeyma made their first contribution in #199