generated from amazon-archives/__template_Apache-2.0
-
Notifications
You must be signed in to change notification settings - Fork 25
Pull requests: aws-neuron/neuronx-distributed-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Kokoro-82M TTS contrib: 82M-param text-to-speech on Neuron (trn2 …
#71
opened Mar 12, 2026 by
jimburtoft
Loading…
11 of 14 tasks
Optimize Whisper decoder cross-attention to skip redundant K/V projections during decode
#70
opened Mar 11, 2026 by
jimburtoft
Loading…
contrib: Add Qwen3-Coder-30B-A3B-Instruct (Qwen3MoE) model
#67
opened Mar 10, 2026 by
yahavb
Loading…
7 of 8 tasks
Add Qwen3-Coder-480B-A35B-Instruct contrib: optimized configs for trn…
#66
opened Mar 10, 2026 by
jimburtoft
Loading…
13 of 14 tasks
[Contribution] SolarOpenForCausalLM Support
#65
opened Mar 10, 2026 by
lifelongeeek
Loading…
12 of 14 tasks
Add Boltz-2 contrib model with NKI kernels for pairformer inference
#64
opened Mar 9, 2026 by
jimburtoft
Loading…
11 of 14 tasks
Add MoLFormer molecular transformer contrib model
#62
opened Mar 8, 2026 by
jimburtoft
Loading…
10 of 14 tasks
[Contribution] GLM4MoeForCausalLM Support
#58
opened Mar 6, 2026 by
lifelongeeek
Loading…
5 tasks done
Add Trinity model family (AfmoeForCausalLM) contrib
#55
opened Feb 27, 2026 by
jimburtoft
Loading…
11 of 12 tasks
feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models
#35
opened Feb 13, 2026 by
lifelongeeek
Loading…
8 of 10 tasks
ProTip!
Filter pull requests by the default branch with base:main.