-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Pull requests: deepspeedai/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Feat: zero3 deprecate elastic checkpoint
#8099
opened Jun 30, 2026 by
nathon-lee
Contributor
Loading…
Warn when zero.Init silently falls back to a single rank (#8084)
#8089
opened Jun 24, 2026 by
akshansh47
Loading…
fix: use local ev_values and wrap dict.values() in list()
#8087
opened Jun 23, 2026 by
hashwnath
Loading…
3 tasks done
ZeRO 1/2: wait on all IPG-bucket producer streams in average_tensor (#8061)
#8080
opened Jun 19, 2026 by
arunshar
Contributor
Loading…
feat: add Trackio as a new experiment monitoring backend
#8065
opened Jun 15, 2026 by
chanduripranav
Loading…
[DeepCompile] fix gather params in dynamo skipped frames for ZeRO3
#8059
opened Jun 11, 2026 by
XAheli
Loading…
7 tasks done
feat(zenflow): run the overlapped CPU optimizer in a native process
#8058
opened Jun 10, 2026 by
Antlera
Collaborator
Loading…
Fix eigenvalue parsing for compression-only quantize configs
#8057
opened Jun 10, 2026 by
sowndappan5
Contributor
Loading…
Add optional torchembed RoPE backend to apply_rotary_pos_emb
#8052
opened Jun 7, 2026 by
py-ai-dev
Loading…
Fix minor comment/docstring typos in runtime and inference modules
#8046
opened Jun 3, 2026 by
nathon-lee
Contributor
Loading…
zero3: defer param release during retain_graph backward #7352
#8045
opened Jun 3, 2026 by
nathon-lee
Contributor
Loading…
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027
opened May 26, 2026 by
PKUWZP
Collaborator
Loading…
3 of 5 tasks
Refactor/torch autocast encapsulate global state
#7946
opened Apr 2, 2026 by
nathon-lee
Contributor
Loading…
Fix ZeRO-3 optimizer initialization validation (#7844)
#7929
opened Mar 28, 2026 by
amadhan882
Loading…
doc: Remove suggestion to build extensions in parallel
#7899
opened Mar 12, 2026 by
Flamefire
Contributor
Loading…
Fix Stage 0 + Ulysses crash: make bwc_tensor_model_parallel_rank() resilient to MP API absence
#7888
opened Mar 6, 2026 by
nathon-lee
Contributor
Loading…
fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter
#7878
opened Feb 27, 2026 by
nathon-lee
Contributor
Loading…
fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842)
#7868
opened Feb 22, 2026 by
harshang03
•
Draft
fix: keep fp32-pinned parameters out of the bf16 cast path in ZeRO-3 (#7747)
#7867
opened Feb 22, 2026 by
harshang03
•
Draft
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-30.