Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix Thor MLA decode arch dispatch
#3173 opened Apr 17, 2026 by iloveai8086 Loading…
WIP: OSS CI Testing for v4.5
#3171 opened Apr 16, 2026 by zekunf-nv Collaborator Loading…
[CuTeDSL][fix]: 1d bias epilogue fix
#3157 opened Apr 9, 2026 by leevan Loading…
Fix incorrect example paths in CuTeDSL docstrings
#3151 opened Apr 6, 2026 by Weili-0234 Loading…
[Hopper CuTeDSL] Add FP8 GEMM with 2xAcc
#3149 opened Apr 5, 2026 by Johnsonms Contributor Loading…
Fix Hopper FMHA performance regression on CUDA < 13.1
#3137 opened Mar 31, 2026 by arvin-chou Loading…
5 of 6 tasks
feat(CuTeDSL): print benchmark time from Blackwell dense_gemm CLI
#3136 opened Mar 30, 2026 by aidando73 Contributor Loading…
Fix elementwise_apply.py
#3129 opened Mar 25, 2026 by HydraQYH Contributor Loading…
[CuTeDSL] Add SM103 grouped block-scaled GEMM kernel and tests
#3124 opened Mar 23, 2026 by Johnsonms Contributor Loading…
Enable strict C++ compiler warnings with -Werror
#3123 opened Mar 22, 2026 by maxwbuckley Loading…
3 of 4 tasks
[bugfix] use acquire to prevent reordering.
#3118 opened Mar 20, 2026 by shubaoyu2 Contributor Loading…
Fix typo in elementwise_add.py
#3116 opened Mar 20, 2026 by HydraQYH Contributor Loading…
Add FlashMoE Publication
#3115 opened Mar 20, 2026 by osayamenja Loading…
[docs] Fix same typo inactive-30d
#3098 opened Mar 9, 2026 by lhtin Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.