Skip to content

Issues: pytorch/torchtitan

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

ZBVZeroBubble error bug Something isn't working
#774 opened Jan 3, 2025 by hhaAndroid
PP InterleavedZeroBubble schedule shows low TPS and high memory usage bug Something isn't working release_blocking Issues that are blocking the milestone / release completion
#773 opened Jan 3, 2025 by tianyu-l torchtitan v1.0.0 release
CP hangs when degree is more than 8 bug Something isn't working release_blocking Issues that are blocking the milestone / release completion
#772 opened Jan 3, 2025 by tianyu-l torchtitan v1.0.0 release
PP-related issues bug Something isn't working release_blocking Issues that are blocking the milestone / release completion
#771 opened Jan 3, 2025 by tianyu-l
2 of 6 tasks
torchtitan v1.0.0 release
HunyuanVideo Support enhancement New feature or request
#768 opened Jan 2, 2025 by bendanzzc
Can I load from non-FSDP optimizer state with FSDP2? question Further information is requested
#765 opened Dec 31, 2024 by syncdoth
FSDP 2 doesn't pad tensors? question Further information is requested
#764 opened Dec 29, 2024 by cassanof
Memory grows due to keeping losses on device better_engineering Repo code quality improvements good first issue Good for newcomers
#763 opened Dec 27, 2024 by carmocca
DeepSeek V3 Support enhancement New feature or request
#760 opened Dec 26, 2024 by casper-hansen
Checkpoint conversion question Further information is requested
#758 opened Dec 20, 2024 by MaxiBoether
Any plans to support DPO training? enhancement New feature or request
#756 opened Dec 20, 2024 by xs1997zju
JobConfig does not support typing enhancement New feature or request
#753 opened Dec 18, 2024 by greeneggsandyaml
Model init with HuggingFace model bug Something isn't working question Further information is requested
#743 opened Dec 16, 2024 by neeldani
Low bit Optimizers & FA-3 bug Something isn't working question Further information is requested
#742 opened Dec 16, 2024 by asahni04
Issue: Loss Discrepancy Between FSDP1 and FSDP2 with AdamW Optimizer question Further information is requested
#724 opened Dec 9, 2024 by Teng-xu
Context parallelism understanding context_parallel question Further information is requested
#723 opened Dec 9, 2024 by jinsong-mao
First Shard Group Save and Load Checkpoint for HSDP question Further information is requested
#709 opened Nov 29, 2024 by qsh-zh
[rfc] torchtitan release practices release_blocking Issues that are blocking the milestone / release completion
#688 opened Nov 22, 2024 by tianyu-l torchtitan v1.0.0 release
[Parallelism] Implement vocabulary parallelism enhancement New feature or request
#680 opened Nov 15, 2024 by casper-hansen
ProTip! Updated in the last three days: updated:>2025-01-04.