Skip to content

Pull requests: ROCm/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feat] Added JAX-Triton bridge for ROCm
#649 opened Jun 24, 2026 by AllenFarcas Contributor Draft
6 of 13 tasks
Add Triton blockwise FP8 training path for ROCm gfx950 ci-level 3 CI test level 3
#647 opened Jun 22, 2026 by JessicaJiang-123 Loading…
5 of 13 tasks
grouped gemm microbenchmark: use te.GroupedLinear
#639 opened Jun 18, 2026 by matthiasdiener Contributor Loading…
13 tasks
Interleaved Driver Benchmarking
#637 opened Jun 18, 2026 by Micky774 Contributor Draft
13 tasks
[ROCm] Fix biased wgrad with fp32 gradient accumulation ci-level 1 CI test level 1
#634 opened Jun 18, 2026 by XinyuJiangCMU Loading…
Enable MultiCastTranspose for expert weights ci-level 3 CI test level 3
#628 opened Jun 16, 2026 by sudhu2k Contributor Loading…
8 of 13 tasks
Add ROCm HIP small-seq fused attention via crossattn_hip_kernel
#625 opened Jun 15, 2026 by VeeraRajasekhar Contributor Loading…
13 tasks
[CI] Add resilience to artifacts fetch
#622 opened Jun 9, 2026 by leo-automation Collaborator Loading…
[FEAT] Microbenchmark add visualization
#620 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Refactored reduction kernels ci-level 3 CI test level 3
#618 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Ifu dev 260419 v2.15
#616 opened Jun 8, 2026 by VeeraRajasekhar Contributor Loading…
13 tasks
Incorporate statistical significance testing to benchmarks
#614 opened Jun 8, 2026 by Micky774 Contributor Loading…
13 tasks
Ipanfilo/ci test fixes ci-level 3 CI test level 3
#612 opened Jun 5, 2026 by ipanfilo Collaborator Loading…
1 of 13 tasks
enable blockwise FP8 quantization on rocm ci-level 1 CI test level 1
#609 opened Jun 3, 2026 by asdfvg123 Loading…
1 of 13 tasks
WIP Lightning Indexer + DSA/HCA API
#606 opened Jun 1, 2026 by Micky774 Contributor Draft
1 of 13 tasks
TE AITER gfx1250 integration WIP
#603 opened May 29, 2026 by Micky774 Contributor Draft
13 tasks
Update QoLA/AITER ci-level 3 CI test level 3
#599 opened May 28, 2026 by Micky774 Contributor Loading…
13 tasks
Bump CI retention days ci-level 1 CI test level 1
#591 opened May 20, 2026 by matthiasdiener Contributor Draft
1 of 13 tasks
add production GEMM tests ci-level 1 CI test level 1
#590 opened May 19, 2026 by matthiasdiener Contributor Loading…
1 of 13 tasks
Add Tealite: pure-Python TransformerEngine for ROCm/AMD GPUs
#581 opened May 7, 2026 by jayfurmanek Contributor Loading…
7 of 8 tasks
[ROCm] Allow bf16/bf16/fp32 in nvte_multi_tensor_gemm dispatcher ci-level 1 CI test level 1
#573 opened May 4, 2026 by lizamd Loading…
13 tasks
[No Merge][No Review] testing aiter auto trigger on gh action ci-level 2 CI test level 2
#570 opened May 1, 2026 by VeeraRajasekhar Contributor Draft
13 tasks
HipKittens MXFP8 GEMM Support ci-level 3 CI test level 3
#566 opened Apr 28, 2026 by alextmagro Contributor Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.