whittle/CHANGELOG.md at main · whittle-org/whittle · GitHub

0.4.1 (2024-12-12)

Fix

tagging
bump

0.4.0 (2024-12-12)

Feat

adds workflow for pretraining a super-network (#173)
allows to pass fabric (#162)

Fix

release.yml: artifact upload for changelog (#211)
release.yml: make bump dependent on test-code (#209)
set cos sin in max_seq_len (#203)
update readme (#193)
release workflow (#202)
fix dependency versions to avoid breaking CI (#195)
colab tutorial notebook and typos (#188)
addition and standardization of docstrings (#176)
Remove rope_cache from max_seq_len setter (#181)
Revert "ci: reusing unit-test.yml in release.yml" (#187)
release workflow clash with branch protection rules (#184)
Revert "fix: reworking release.yml to avoid clashing with branch protection rules" (#182)
reworking release.yml to avoid clashing with branch protection rules (#180)
set input variables as required positional arguments (#172)
update tokenizers in pyproject.toml (#168)
type hints and docstring for mkdocs build --clean --strict (#167)
forcing deepspeed to use CPU for profiling FLOPS (#154)
deprecate flexible mlp heads (#160)

0.3.0 (2024-10-24)

Feat

add support for LLamaMLP in extract_sub_network (#147)
adding flops and macs profiling for subnets (#145)
add script to profile latency (#141)
modify rope for llama-3 and support llama-3.2 (#131)
add gpt tutorial notebook utils (#122)
add installation instruction to documentation (#121)

Fix

refactor names of metric (#152)
Extract weights for norm layers, test with random initialization. (#151)
handle device in GPT model properly (#143)
rename call function (#144)
delete supernet_configs directory (#140)
deprecate sample_random_indices (#133)
support params, mag when sharing layer norm in phi-2 (#127)
reset random layers in reset_super_network (#126)
support GQA param count (#124)
update readme (#111)

0.2.0 (2024-09-08)

Feat

litgpt update (#95)

Fix

adding cz config (#119)
removing version parsing in whittle/init.py (#118)
renaming whittle/version to whittle/version.py (#117)
commitizen configuration (#115)
remove deprecated module (#105)
delete old code (#104)
allow to pass other loss function to training strategies (#101)
set random state properly in sampler (#103)