-
Notifications
You must be signed in to change notification settings - Fork 314
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Use causal mask type for training or prefill with fixed seqlen when
#1248
opened Feb 7, 2025 by
wenchenvincent
•
Draft
3 of 4 tasks
Enable mlperf to use mixtral-v1 tokenized dataset to avoid dropping in total_weights when splitting data across multiple host
#1246
opened Feb 7, 2025 by
aireenmei
Loading…
4 tasks done
[wip-prototype]chunked prefill on 1k prompt len
#1237
opened Feb 4, 2025 by
mailvijayasingh
Loading…
4 tasks
Make float32_qk_product and float32_logits apply during inference
#1225
opened Feb 1, 2025 by
philip-essential
Loading…
4 tasks done
Add Pathways Benchmarking Recipes for Scale Testing (and fix bugs)
#1220
opened Jan 31, 2025 by
SujeethJinesh
•
Draft
4 tasks
[DRAFT] Initial commit Maxtext unit tests with Pathways backend.
#1211
opened Jan 28, 2025 by
RoshaniN
Loading…
4 tasks
Run Maxtext unit test on candidate JStS Image
#1204
opened Jan 27, 2025 by
parambole
Loading…
4 tasks done
[RFC]Prototype the MPMD GPipe-circular with single-host H100=8.
#1202
opened Jan 27, 2025 by
lc5211
Loading…
1 task
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-01-07.