-
Notifications
You must be signed in to change notification settings - Fork 89
Pull requests: sgl-project/SpecForge
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
delete last message when regenerating train data
#248
opened Oct 8, 2025 by
Shang-Pin
Loading…
6 tasks done
fix: remove attention mask shift in flex attention implementation
#245
opened Oct 4, 2025 by
jialefu
Loading…
6 tasks
fix: remove attention mask shift & add pe shift
#244
opened Oct 3, 2025 by
Liyuhui-12
Loading…
6 tasks
Fix resume offline train logic. Add loading optimizer state
#243
opened Sep 29, 2025 by
hanq-moreh
Loading…
6 tasks
fix: mid aux hidden layer id calculation in online mode
#240
opened Sep 24, 2025 by
Liu-Xue-Song
Loading…
6 tasks
support deepseek-v2-lite online train and support yarn rope
#224
opened Sep 8, 2025 by
jiapingW
Loading…
6 tasks
Fix for when draft model hidden dimension is different from target model hidden dimension
#183
opened Aug 26, 2025 by
yilian49
Loading…
Feat: Support TP for long-context draft model training
high priority
#117
opened Aug 6, 2025 by
yd-oom
Loading…
6 tasks
ProTip!
no:milestone will show everything without a milestone.