Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix inference unit test complexity: low
#4589 opened May 2, 2026 by maanug-nv Contributor Loading…
5 tasks
Inference bug-fix: Re-enable EP syncs for the legacy A2A dispatcher complexity: low Final Review PR is in the "final review" stage
#4587 opened May 1, 2026 by sidsingh-nvidia Contributor Loading…
5 tasks
Add opt-in nonuniform tensor parallelism community-request
#4585 opened May 1, 2026 by daiyaanarfeen Loading…
5 tasks done
LanguageModelEmbedding: avoid kernel launch overhead for [b,s,h]→[s,b,h] in inference complexity: low Final Review PR is in the "final review" stage
#4583 opened May 1, 2026 by mathemakitten Contributor Loading…
5 tasks
Consolidate inference example scripts
#4581 opened May 1, 2026 by santhnm2 Contributor Draft
5 tasks
Fix buffers in refit Run tests
#4580 opened May 1, 2026 by wdykas Contributor Draft
5 tasks
Named validation sets complexity: low
#4578 opened May 1, 2026 by RPrenger Contributor Loading…
5 tasks
ci: Test on ephemeral-dev
#4577 opened May 1, 2026 by chtruong814 Contributor Draft
5 tasks
Fix Hang in tests Approved All necessary approvals have been made complexity: low Run tests
#4575 opened May 1, 2026 by wdykas Contributor Loading…
5 tasks
Update energon version requirement complexity: low
#4572 opened May 1, 2026 by maanug-nv Contributor Loading…
5 tasks
Add the CSA/HCA prototype to HybridModel
#4569 opened May 1, 2026 by guihong-nv Contributor Draft
5 tasks done
Add vLLM grouped gemm backend for MoE inference Approved All necessary approvals have been made complexity: high Run functional tests
#4566 opened Apr 30, 2026 by santhnm2 Contributor Loading…
5 tasks
Guard vocab reduce_scatter on TP > 1 Approved All necessary approvals have been made complexity: low
#4565 opened Apr 30, 2026 by mathemakitten Contributor Loading…
5 tasks
Context cpu async schedule
#4564 opened Apr 30, 2026 by lmcafee-nvidia Contributor Draft
5 tasks
Update colwise data after param AG in eval
#4563 opened Apr 30, 2026 by WanZzzzzz Contributor Loading…
5 tasks
Update colwise data after synced param AG complexity: low Final Review PR is in the "final review" stage
#4562 opened Apr 30, 2026 by WanZzzzzz Contributor Loading…
1 of 5 tasks
ci: add cadence input for test filtering in CI workflows
#4561 opened Apr 30, 2026 by balasaajay Contributor Draft
5 tasks
Add async scheduling for dynamic inference
#4558 opened Apr 30, 2026 by lmcafee-nvidia Contributor Draft
1 of 5 tasks
Context cpu async schedule (CD-v3)
#4557 opened Apr 30, 2026 by lmcafee-nvidia Contributor Draft
4 of 5 tasks
Make last_token_logits graphable Approved All necessary approvals have been made complexity: low
#4552 opened Apr 30, 2026 by tdene Contributor Queued
5 tasks
ProTip! no:milestone will show everything without a milestone.