-
Notifications
You must be signed in to change notification settings - Fork 730
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add AMD/ROCm support for SSD TBE inference
cla signed
fb-exported
meta-exported
module: rocm
#5561
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add TurboSSDInferenceModule for HSTU serving integration
cla signed
fb-exported
meta-exported
#5560
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add AMD/ROCm support for SSD TBE inference
cla signed
fb-exported
meta-exported
module: rocm
#5559
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add TurboSSDInferenceModule for HSTU serving integration
cla signed
fb-exported
meta-exported
#5558
opened Mar 31, 2026 by
goldcoderZ
Loading…
2D weights support for permute_1D_data_kernel_vec
cla signed
fb-exported
meta-exported
#5557
opened Mar 31, 2026 by
kausv
Loading…
Remove pt2_cpu stubs and move isValidBlockingFactor
cla signed
#5556
opened Mar 31, 2026 by
cyyever
Loading…
Remove omp_set_num_threads from RadixSortTest to fix ASan leak
cla signed
#5555
opened Mar 31, 2026 by
cyyever
Loading…
Add streaming_update() and load_snapshot() for inference (#5554)
cla signed
fb-exported
meta-exported
#5554
opened Mar 31, 2026 by
goldcoderZ
Loading…
Bump minimum GCC to 11.4 (#5537)
cla signed
fb-exported
meta-exported
#5553
opened Mar 31, 2026 by
q10
Loading…
Add embedding cache support to oneflow base model
cla signed
fb-exported
meta-exported
#5552
opened Mar 30, 2026 by
EddyLXJ
Loading…
Add failure logging and alerting for SSD offloading
cla signed
fb-exported
meta-exported
#5542
opened Mar 26, 2026 by
Frederick-Zhu
Loading…
Move internal enrichment files to fb/ for OSS exclusion
cla signed
fb-exported
meta-exported
#5541
opened Mar 26, 2026 by
EddyLXJ
Loading…
Fix DramKV race: hold rlock during inplace update writes
cla signed
#5536
opened Mar 26, 2026 by
cyyever
Loading…
Add meta function for block_bucketize_sparse_features_inference (#5529)
cla signed
fb-exported
meta-exported
#5529
opened Mar 25, 2026 by
georgiaphillips
Loading…
Precompute writeback dedup indices in forward to eliminate GPU-CPU sync in backward (#5522)
cla signed
fb-exported
meta-exported
#5522
opened Mar 24, 2026 by
Zhihan-Lu
Loading…
Add tests for group_index_select_dim0 mixed-dtype validation
cla signed
fb-exported
meta-exported
#5507
opened Mar 21, 2026 by
q10
Loading…
Fix Half2 UVM performance regression with vectorized store
cla signed
fb-exported
meta-exported
module: rocm
#5499
opened Mar 19, 2026 by
q10
Loading…
Use atomicAdd for lxu_cache_locking_counter increments/decrements
cla signed
fb-exported
meta-exported
#5479
opened Mar 16, 2026 by
goldcoderZ
Loading…
Implement pre-sorting, caching and contigous warp processing in group_index_select backward
cla signed
module: rocm
#5476
opened Mar 12, 2026 by
avbokovoy
Loading…
Deduplicate check to reduce binary size
cla signed
fb-exported
meta-exported
#5474
opened Mar 12, 2026 by
spcyppt
Loading…
Folly header clean up (fbgemm)
cla signed
fb-exported
meta-exported
#5471
opened Mar 10, 2026 by
mzlee
Loading…
Extend
permute_2D_sparse_data with optional pre-allocated output buffers
cla signed
fb-exported
meta-exported
#5461
opened Mar 9, 2026 by
TroyGarden
Loading…
Clean up kernel code by deleting unused options and code logic
cla signed
fb-exported
meta-exported
#5456
opened Mar 6, 2026 by
howei
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.