-
Notifications
You must be signed in to change notification settings - Fork 578
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix test breaking in internal github repo
cla signed
fb-exported
#4162
opened May 20, 2025 by
q10
Loading…
optimization of perKVhead quantization
cla signed
fb-exported
#4161
opened May 20, 2025 by
Aya-ZIbra
Loading…
iRoPE varseq flag for pre-calculated kv qparams
cla signed
fb-exported
#4160
opened May 20, 2025 by
Aya-ZIbra
Loading…
Add checks for dimensions of pooled_embs
cla signed
fb-exported
#4159
opened May 20, 2025 by
q10
Loading…
support filling partial rows from backend
cla signed
fb-exported
#4158
opened May 20, 2025 by
duduyi2013
Loading…
Leverage fuse kernel in inference workload (#1237)
cla signed
fb-exported
#4157
opened May 20, 2025 by
ycui1984
Loading…
Jemalloc Mempool and Adaptation for CPU HASHTABLE
cla signed
#4154
opened May 20, 2025 by
ArronHZG
Loading…
Add more parameter specializations for autovec TBE kernels
cla signed
fb-exported
#4153
opened May 20, 2025 by
excelle08
Loading…
improve read/write performance by 100%
cla signed
fb-exported
#4150
opened May 19, 2025 by
steven1327
Loading…
Make iter persistent for AdagradW
cla signed
fb-exported
#4147
opened May 17, 2025 by
minhua-chen
Loading…
support get state dict and apply state dict
cla signed
fb-exported
#4145
opened May 17, 2025 by
emlin
Loading…
implement optimizer state with opt offloading
cla signed
fb-exported
#4141
opened May 16, 2025 by
emlin
Loading…
Update heuristic for Cutlass BF16 Grouped GEMM
cla signed
fb-exported
#4138
opened May 16, 2025 by
cthi
Loading…
Simplify grouped gemm output allocations
cla signed
fb-exported
#4134
opened May 16, 2025 by
jwfromm
Loading…
Update the rowwise adagrad optimizer to leverage optimizer state offloading, v3
cla signed
fb-exported
#4133
opened May 15, 2025 by
q10
Loading…
Add TBE data configuration reporter to TBE forward"
cla signed
fb-exported
#4130
opened May 15, 2025 by
gchalump
Loading…
Refactor Cutlass BF16 Grouped GEMM
cla signed
fb-exported
#4124
opened May 14, 2025 by
cthi
Loading…
Trim constexpr from isA to improve Windows clang-cl support.
cla signed
#4119
opened May 13, 2025 by
ScottTodd
Loading…
Replace
C10_CUDA_KERNEL_LAUNCH_CHECK()
in the KernelLauncher
cla signed
fb-exported
#4097
opened May 8, 2025 by
q10
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.