Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CuTeDSL] Add SM120 MXF4/NVFP4 native-TMA path
#3273 opened May 25, 2026 by alecco Loading…
Add Blackwell GeForce blockscaled GEMM examples
#3272 opened May 25, 2026 by shubaoyu2 Contributor Loading…
v4.5.2 update.
#3265 opened May 24, 2026 by Junkai-Wu Collaborator Loading…
v4.5.2 update.
#3264 opened May 24, 2026 by Junkai-Wu Collaborator Loading…
Filter SM120 mixed 8-bit tiles for FP6 ElementD
#3247 opened May 19, 2026 by zhils Loading…
fix an intermittent accuracy isse
#3233 opened May 15, 2026 by dishengbin Loading…
W4a8 speedup v2
#3226 opened May 11, 2026 by mak-corp Loading…
Avoid unordered_map for runtime datatype mapping
#3223 opened May 11, 2026 by LwhJesse Loading…
FMHA examples: use cute::min in device functions
#3222 opened May 11, 2026 by LwhJesse Loading…
[examples][CuTeDSL] add MoE dispatch+combine example with NVSHMEM
#3221 opened May 11, 2026 by shubaoyu2 Contributor Loading…
ProTip! What’s not been updated in a month: updated:<2026-04-25.