Update learning rate schedule in training/default.yaml #58

jswijnands · 2024-09-17T15:27:08Z

This updates the default values for the learning rate schedule. Specifically, the local learning rate is increased to 8x the original learning rate. In addition, the number of iterations is reduced from 300k to 150k. This has led to similar accuracy in stretched grid experiments using the graph transformer architecture, while requiring only 50% of the GPU compute. Optimised default values could save GPU resources across the Pilot Project.

Suggested review questions:

Are these settings also an enhancement for global / LAM models? (tested only for stretched grid)
Are these settings also an enhancement for GNN / transformer models? (tested only for graph transformer architecture)

This updates the default values for the learning rate schedule. Specifically, the local learning rate is increased to 8x the original learning rate. In addition, the number of iterations is reduced from 300k to 150k. This has led to similar accuracy in stretched grid experiments using the graph transformer architecture, while requiring only 50% of the GPU compute.

FussyDuck · 2024-09-17T15:27:14Z

All committers have signed the CLA.

JesperDramsch · 2024-09-23T13:20:27Z

Hi @jswijnands, thank you for the contribution.

We'll look into benchmarking this on the non-stretched grid models and evaluate if we should maybe make this a special config, as I believe the stretched-grid may generally benefit from different training defaults.

github-actions bot added the contributor label Sep 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update learning rate schedule in training/default.yaml #58

Update learning rate schedule in training/default.yaml #58

jswijnands commented Sep 17, 2024

FussyDuck commented Sep 17, 2024 •

edited

Loading

JesperDramsch commented Sep 23, 2024

Update learning rate schedule in training/default.yaml #58

Are you sure you want to change the base?

Update learning rate schedule in training/default.yaml #58

Conversation

jswijnands commented Sep 17, 2024

FussyDuck commented Sep 17, 2024 • edited Loading

JesperDramsch commented Sep 23, 2024

FussyDuck commented Sep 17, 2024 •

edited

Loading