Releases · macrocosm-os/pretraining

14 Feb 14:11

cryptal-mc

v5.1.0

b163027

Release v5.1.0 Latest

Latest

Announcing release v5.1.0

Here are the main changes:

Changes

Updated bittensor dependency to 9.0.0 to support the dTao release.
Cleaned out the 14B* (datamix experiment) code.
reduced subset loaders request ttl to 15seconds.

NOTES TO VALIDATORS

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

16 Jan 19:11

cryptal-mc

v5.0.0

346e17f

Release v5.0.0

Announcing release v5.0.0

Here are the main changes:

Changes

1. Native support for multi-dataset evaluation
Immediate effect

Implemented support for multi-dataset evaluation for all competitions using EvalTasks from taoverse 1.3.7

2. Updating the data mix
Activation block: 4_732_978

Both 3B and 14B will now be evaluated on the following mix:

HuggingFaceFW/fineweb (30%)
HuggingFaceFW/fineweb-edu-score-2 (25%)
bigcode/the-stack-v2-dedup (35%)
laion/pes2ox-fulltext (5%)
HuggingFaceTB/FineMath:finemath-3plus (3%)
HuggingFaceTB/FineMath:infiwebmath-3plus (2%)

Notice that the 14B* will be retired and replaced by the multi-dataset 14B with the above mix.

3. New epsilon lower bounds and decay intervals
Activation block 4_732_978

The epsilon decay interval and bounds will be updated for all competitions as follows:

3B competition:
Updated: decays from 0.005 to 0.0002 over 4 days
Updated: decays from 0.005 to 0.0005 over 7 days
14B competitions:
Updated: decays from 0.005 to 0.0002 over 5 days
Updated: decays from 0.005 to 0.0005 over 10 days

4. Updated emission distribution for competitions
Activation block 4_732_978

3B → 30%
14B → 70%

Other updates

Improved weight setting by spinning out separate threads that use different subtensors.
Switched from bt.logging to taoverse.logging.

NOTES TO VALIDATORS

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

19 Dec 15:29

cryptal-mc

v4.6.4

28b37f6

Release v4.6.4

Announcing release v4.6.4

Here are the main changes:

Changes

Bittensor dependency version was bumped to 8.5.1.

The only action needed for validators is to pull this new release and reinstall dependencies.

NOTES TO VALIDATORS

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

12 Dec 02:27

cryptal-mc

v4.6.3

75356e4

Release v4.6.3

Announcing release v4.6.3

Here are the main changes:

Changes

1. Bittensor version bumped to 8.4.3

We have noticed that we have a higher success rate for weight setting with this version than 6.9.4.

2. Changed S3 bucket URL for the stack v2-dedup dataset

The official doc provides two URLs to access the softwareheritage S3 bucket.
https://docs.softwareheritage.org/user/using_data/index.html#contents-on-s3

The URL we used before seem to fail in some region. The other URL https://softwareheritage.s3.amazonaws.com/content/<sha1> seem to be more robust, so we updated the dataloader to use it.

NOTES TO VALIDATORS

-IMPORTANT: The newly added dataset for code the-stack-v2-dedup requires a Hugging Face access token and S3 secret and access keys. You can learn how to obtain and configure those tokens in our validator documentation here.

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

03 Dec 17:03

cryptal-mc

v4.6.2

bc90394

Release v4.6.2

Announcing release v4.6.2

Here are the main changes:

Changes

1. Increasing code proportion in the data mix
Activation block: 4_453_709

We are replacing the-stack-dedup with the-stack-v2-dedup in the 14B-star competition and increasing the code proportion in the validation dataset from ~5% to ~15%.

Datasets used during evaluation for the 14B-star competition is as follows:

HuggingFaceFW/fineweb-edu-score-2 (85%)
bigcode/the-stack-v2-dedup (15%)

Datasets for the 3B and 14B competitions are left unchanged.

2. New epsilon lower bounds and decay intervals
Activation block 4_453_709

The epsilon decay interval and bounds will be updated for all competitions as follows:

3B competition:
Current: decays from 0.005 to 0.0005 over 7 days
Updated: decays from 0.005 to 0.0002 over 4 days
14B and 14Bstar competitions:
Current: decays from 0.005 to 0.0005 over 7 days
Updated: decays from 0.005 to 0.0002 over 5 days

3. Updated emission distribution for competitions
Activation block 4_453_709

3B → 20%
14B → 40%
14B-star → 40%

4. Fixed all package version in requirements.txt
To avoid any installation issues and package compatibilities, we have fixed all dependency package versiosn in the requirements.txt file. The installation experience should be smoother now.

NOTES TO VALIDATORS

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

13 Nov 18:07

Sid-Data-Universe

v4.6.1

49138f9

Release 4.6.1

This release fixes a bug where models that are winning in 14B* but are non-competitive in 14B would not be kept for evaluation.

Assets 2

08 Nov 19:19

cryptal-mc

v4.6.0

69458c0

Release v4.6.0

Announcing release v4.6.0

Here are the main changes:

Changes

1. 14B Multi-Dataset Competition.
Activation block: 4_252_646

Datasets used during evaluation:

HuggingFaceFW/fineweb-edu-score-2 (95%)
bigcode/the-stack-dedup (5%)

2. Retiring the 700M competition
Activation block: 4_252_646

We will sunset the 700M competition at the same time that the new 14B rises. The 14B multi-dataset competition will be taking in the emissions of 700M.

3. New epsilon lower bound and decay interval
Activation block 4_252_646

We will slightly increase the lower bound for epsilon from 0.0001 to 0.0005 and decrease the decay interval from 10 to 7 days.

4. Evaluation data syncing
Effected immediately.

Validation batches will now be synced across all validators. We will also introduce a delay in picking up models for validation to prevent exploits by training on the exact upcoming batch.

5. Deduplicating evaluation data
Effected immediately.

This issue has been raised on Discord. Picking a random offset when sampling batches could result in two validation pages with some overlap when the offset difference is less than the number of samples pulled at each offset position. Although this is a very rare issue when dealing with large datasets, this issue has been fixed now only on FineWeb-Edu2 but it will be generalized to all loaders in a following releases.

6. Setting repo visibility at model upload
Effected immediately.

Added the --update_repo_visibility argument to the upload_mode.py to enable changing the HuggingFace repo at model upload rather than setting this manually.

7. New emission distribution
3B → 29%
14B → 57%
14B Multi-Dataset → 14%

NOTES TO VALIDATORS

-The newly added dataset for code The Stack V1-dedup requires an Hugging Face access token. You can learn how to obtain one in our validator documentation here.

-Please also make sure to rerun pip install to ensure updated dependencies.
python -m pip install -e

Assets 2

15 Oct 11:50

cryptal-mc

v4.5.3

dd068a2

Release v4.5.3

Announcing release v4.5.3

Here are the main changes:

Changes

This fix first filters out all models which can't beat all earlier models at full epsilon decay, before computing win rates (and incentive weights). This will ensure that:
- All clones are filtered out before we compute win rates so clones can no longer provide an artificially high win rate.
- Any model that is not better than the fully-decayed top model but submitted at a later block is discarded from the evaluation process.
Validator version bumped to 3.4.0

Assets 2

11 Oct 09:41

cryptal-mc

v4.5.2

0637ee3

Release v4.5.2

Announcing release v4.5.2

Here are the main changes:

Changes

Bumped validator version to force all validators to re-evaluate models that had been excluded due to 'inf' loss caused by the low compute loss TTL in 4.5.0.
Decay period for 14B has been temporarily extended from 7 to 10 days, so models that had been excluded can have a second chance to get a fair ranking.

Assets 2

10 Oct 19:24

cryptal-mc

v4.5.1

8468a04

Release v4.5.1

Announcing release v4.5.1

Here are the main changes:

Changes

Updated to the latest taoverse version v1.0.6 which correctly gets the newest instead of the oldest filetime in a directory.
Bumped the grace period for downloading models to 10 minutes. this should provide additional buffer for those with slower connections when downloading 14B models.
Reduced the number of validation pages to 11 instead of 18.
Increasing compute loss TTL to 430s instead of 400s (Although all current 14B models complete validation under 400s seconds).
Reduced epsilon decay period to 1 week instead of two for 14B models.

Note to validators

Please also make sure to rerun pip install to ensure updated dependencies.

python -m pip install -e .

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Announcing release v5.1.0

Changes

NOTES TO VALIDATORS

Announcing release v5.0.0

Changes

NOTES TO VALIDATORS

Announcing release v4.6.4

Changes

NOTES TO VALIDATORS

Announcing release v4.6.3

Changes

NOTES TO VALIDATORS

Announcing release v4.6.2

Changes

NOTES TO VALIDATORS

Announcing release v4.6.0

Changes

NOTES TO VALIDATORS

Announcing release v4.5.3

Changes

Announcing release v4.5.2

Changes

Announcing release v4.5.1

Changes

Note to validators

Releases: macrocosm-os/pretraining

Release v5.1.0

Announcing release v5.1.0

Changes

NOTES TO VALIDATORS

Release v5.0.0

Announcing release v5.0.0

Changes

NOTES TO VALIDATORS

Release v4.6.4

Announcing release v4.6.4

Changes

NOTES TO VALIDATORS

Release v4.6.3

Announcing release v4.6.3

Changes

NOTES TO VALIDATORS

Release v4.6.2

Announcing release v4.6.2

Changes

NOTES TO VALIDATORS

Release 4.6.1

Release v4.6.0

Announcing release v4.6.0

Changes

NOTES TO VALIDATORS

Release v4.5.3

Announcing release v4.5.3

Changes

Release v4.5.2

Announcing release v4.5.2

Changes

Release v4.5.1

Announcing release v4.5.1

Changes

Note to validators