Skip to content

Pull requests: ServiceNow/Fast-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Vsion dataset
#383 opened Oct 30, 2025 by jlamypoirier Draft
Fixes for loading Apriel checkpoint from HF format
#382 opened Oct 28, 2025 by bigximik Loading…
1 of 8 tasks
New memmap dataset format
#381 opened Oct 18, 2025 by jlamypoirier Loading…
Cleanup modeling file for Apriel-H
#379 opened Oct 16, 2025 by nitsanluke Loading…
Language model sample
#378 opened Oct 16, 2025 by jlamypoirier Loading…
Dataset interface
#377 opened Oct 15, 2025 by jlamypoirier Loading…
Add stochastic mixer for supernet training
#373 opened Oct 12, 2025 by tscholak Loading…
Nemotron-H mamba2
#355 opened Aug 21, 2025 by oleksost Loading…
2 of 26 tasks
[Dev] Hybrid dev branch
#347 opened Aug 7, 2025 by RaymondLi0 Loading…
fix loss masking
#345 opened Aug 6, 2025 by RaymondLi0 Draft
1 of 26 tasks
[WIP] Multimodal SSM + TP
#338 opened Jul 29, 2025 by RaymondLi0 Draft
25 tasks
WIP: Hybrid Multimodal
#332 opened Jul 21, 2025 by RaymondLi0 Draft
26 tasks
[work in progress] support for kv_cache
#322 opened Jul 4, 2025 by bigximik Draft
25 tasks
Masked Diffusion Training with Shift
#294 opened Jun 10, 2025 by nitsanluke Draft
1 of 25 tasks
[Prototype] Multimodal Audio
#272 opened May 15, 2025 by tobyzl2 Draft
25 tasks
[Prototype] Multimodal (vision) support
#227 opened Apr 8, 2025 by sohamparikh Loading…
8 tasks
[inactive] Track entropy and MI of routing distribution for topk MoE enhancement New feature or request
#188 opened Mar 14, 2025 by oleksost Draft
9 of 22 tasks
ProTip! Updated in the last three days: updated:>2025-10-27.