-
Notifications
You must be signed in to change notification settings - Fork 267
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Route pipeline model RunOptions through SetRunOption for proper special key handling
#2044
opened Mar 24, 2026 by
Copilot
AI
Loading…
Add Qwen3.5 hybrid decoder export support (GatedDeltaNet + Attention)
#2043
opened Mar 24, 2026 by
apsonawane
Loading…
Rename NemotronCacheConfig to NemotronConfig and add blank penalty to the decoder
#2042
opened Mar 22, 2026 by
nenad1002
Loading…
Decouple plugin execution providers (EPs) from the USE_WINML pre-processor macro
#2038
opened Mar 19, 2026 by
baijumeswani
Loading…
Add WebGPU EP support and repetitions flag to whisper.py
#2032
opened Mar 17, 2026 by
qjia7
Loading…
GenAI changes to support EPContext compilation and validation
#1993
opened Feb 27, 2026 by
lnigam
Loading…
remove one assert not verified with model microsoft/OptiMind-SFT
#1975
opened Feb 12, 2026 by
xadupre
Loading…
Also clear provider_options when ClearProviders is called
#1894
opened Nov 26, 2025 by
Zhaeong
Loading…
Enable GENAI with
FetchContent or find_package() for client projects
#1858
opened Nov 9, 2025 by
apwojcik
Loading…
Modify Model Builder to build paged attention models
#1605
opened Jul 3, 2025 by
aciddelgado
•
Draft
Previous Next
ProTip!
Follow long discussions with comments:>50.