Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge with upstream #48

Open
wants to merge 2,867 commits into
base: main
Choose a base branch
from
Open

Merge with upstream #48

wants to merge 2,867 commits into from

Conversation

Quentin-Anthony
Copy link
Collaborator

No description provided.

@Quentin-Anthony Quentin-Anthony self-assigned this Jan 15, 2024
mathemakitten and others added 29 commits December 20, 2024 21:51
…PE tests to use fusion

Co-authored-by: Mcore Bot <[email protected]>
Co-authored-by: Xin Yao <[email protected]>
Correct strides for bshd layout and revert RoPE tests to use fusion

See merge request ADLR/megatron-lm!2469
Add model checkpoint links

See merge request ADLR/megatron-lm!2494
…ks that don't have trainable params

Co-authored-by: Jon Barker <[email protected]>
Support --freeze-LM and --freeze-ViT with ranks that don't have trainable params

See merge request ADLR/megatron-lm!2285
Move mmodal evaluation code to its own folder

See merge request ADLR/megatron-lm!2491
Updating T5 codes to fix bugs

See merge request ADLR/megatron-lm!2471
ci: Add memory consumption to tests

See merge request ADLR/megatron-lm!2467
Reuse optimizer's main_params to compute param norm in a memory-efficient way

See merge request ADLR/megatron-lm!2483
Add NeMo MoE test.

See merge request ADLR/megatron-lm!2460
ci: Move most of LTS tests to nightly

See merge request ADLR/megatron-lm!2496
Video training

See merge request ADLR/megatron-lm!2500
ci: Update golden values of nightlies

See merge request ADLR/megatron-lm!2511
Make generate function only return results for newly added requests

See merge request ADLR/megatron-lm!2370
ci: Use torchrun

See merge request ADLR/megatron-lm!2507
chore: Fix local generator script

See merge request ADLR/megatron-lm!2519
Co-authored-by: William Dykas <[email protected]>
Co-authored-by: Mcore Bot <[email protected]>
Co-authored-by: William Dykas <[email protected]>
Co-authored-by: root <[email protected]>
ananthsub and others added 30 commits February 11, 2025 11:42
[dist ckpt] Remove alias LocalNonpersitentObject

See merge request ADLR/megatron-lm!2458
[dist ckpt] Resolve todos in `_split_by_size_and_type` during saving

See merge request ADLR/megatron-lm!2623
Embedder fix for radio

See merge request ADLR/megatron-lm!2651
tests: test_builder

See merge request ADLR/megatron-lm!2670
Llava unit test fix

See merge request ADLR/megatron-lm!2669
Warn instead of error when model_opt is enabled with local checkpointing.

See merge request ADLR/megatron-lm!2667
Reduce NCCL memory cost in UT

See merge request ADLR/megatron-lm!2633
Enabling UCC backend for PP communication

See merge request ADLR/megatron-lm!2116
Fix for Frozen QK LayerNorm when training VLM with freeze VIT

See merge request ADLR/megatron-lm!2632
chore: Bump version

See merge request ADLR/megatron-lm!2680
Basic context and sequence parallel support in VLM example

See merge request ADLR/megatron-lm!2561
Co-authored-by: jianbinc <[email protected]>
Co-authored-by: 顾慎 <[email protected]>
Co-authored-by: 李鹏 <[email protected]>
Co-authored-by: 黄俊 <[email protected]>
Co-authored-by: lostkevin <[email protected]>
Co-authored-by: lostkevin <[email protected]>
Co-authored-by: root <[email protected]>
Optimizer CPU offload support

See merge request ADLR/megatron-lm!2526
Fix distributed checkpoint tests

See merge request ADLR/megatron-lm!2675
Co-authored-by: Slawomir Kierat <[email protected]>
Co-authored-by: Helen Ngo <[email protected]>
Co-authored-by: Mcore Bot <[email protected]>
Statically allocate KV cache for MCore inference

See merge request ADLR/megatron-lm!2641
Co-authored-by: Deepak Narayanan <[email protected]>
Co-authored-by: Cyril Meurillon <[email protected]>
Various improvements to RerunStateMachine

See merge request ADLR/megatron-lm!2659
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.