-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix inference unit test
complexity: low
#4589
opened May 2, 2026 by
maanug-nv
Contributor
Loading…
5 tasks
Inference bug-fix: Re-enable EP syncs for the legacy A2A dispatcher
complexity: low
Final Review
PR is in the "final review" stage
#4587
opened May 1, 2026 by
sidsingh-nvidia
Contributor
Loading…
5 tasks
observability: add metrics instrumentation to DataParallelInferenceCoordinator
community-request
#4586
opened May 1, 2026 by
DhineshPonnarasan
Contributor
Loading…
Add opt-in nonuniform tensor parallelism
community-request
#4585
opened May 1, 2026 by
daiyaanarfeen
Loading…
5 tasks done
LanguageModelEmbedding: avoid kernel launch overhead for [b,s,h]→[s,b,h] in inference
complexity: low
Final Review
PR is in the "final review" stage
#4583
opened May 1, 2026 by
mathemakitten
Contributor
Loading…
5 tasks
Named validation sets
complexity: low
#4578
opened May 1, 2026 by
RPrenger
Contributor
Loading…
5 tasks
Fix Hang in tests
Approved
All necessary approvals have been made
complexity: low
Run tests
#4575
opened May 1, 2026 by
wdykas
Contributor
Loading…
5 tasks
Update energon version requirement
complexity: low
#4572
opened May 1, 2026 by
maanug-nv
Contributor
Loading…
5 tasks
Reduce rollout broadcast memory usage and add safe optional torch.distributed import handling
community-request
waiting-on-customer
Waiting on the original author to respond
#4571
opened May 1, 2026 by
brukcodes
Loading…
5 tasks
Enable shared expert overlap with allgatherv in inference
complexity: low
#4570
opened May 1, 2026 by
sidsingh-nvidia
Contributor
Loading…
5 tasks
Add the CSA/HCA prototype to HybridModel
#4569
opened May 1, 2026 by
guihong-nv
Contributor
•
Draft
5 tasks done
Add vLLM grouped gemm backend for MoE inference
Approved
All necessary approvals have been made
complexity: high
Run functional tests
#4566
opened Apr 30, 2026 by
santhnm2
Contributor
Loading…
5 tasks
Guard vocab reduce_scatter on TP > 1
Approved
All necessary approvals have been made
complexity: low
#4565
opened Apr 30, 2026 by
mathemakitten
Contributor
Loading…
5 tasks
Update colwise data after param AG in eval
#4563
opened Apr 30, 2026 by
WanZzzzzz
Contributor
Loading…
5 tasks
Update colwise data after synced param AG
complexity: low
Final Review
PR is in the "final review" stage
#4562
opened Apr 30, 2026 by
WanZzzzzz
Contributor
Loading…
1 of 5 tasks
ci: add cadence input for test filtering in CI workflows
#4561
opened Apr 30, 2026 by
balasaajay
Contributor
•
Draft
5 tasks
Add async scheduling for dynamic inference
#4558
opened Apr 30, 2026 by
lmcafee-nvidia
Contributor
•
Draft
1 of 5 tasks
Context cpu async schedule (CD-v3)
#4557
opened Apr 30, 2026 by
lmcafee-nvidia
Contributor
•
Draft
4 of 5 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.