Skip to content

Pull requests: microsoft/onnxruntime-genai

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

WIP: TurboQuant for ORT WebGPU
#2084 opened Apr 14, 2026 by sushraja-msft Contributor Draft
[WebGPU] Support continuous decoding (RewindTo) with graph capture
#2083 opened Apr 13, 2026 by qjia7 Contributor Loading…
extend modelbuilder to build Olmo3, SmolLM3 and other models
#2078 opened Apr 10, 2026 by xadupre Member Loading…
Add onStageComplete
#2074 opened Apr 8, 2026 by apsonawane Contributor Loading…
Enable CUDA graph capture for CUDA EP to improve decode throughput
#2070 opened Apr 7, 2026 by apsonawane Contributor Loading…
Add MIGraphX execution provider support
#2069 opened Apr 5, 2026 by aditya-dl Loading…
Fix: Win32 build failure when paths contain spaces
#2053 opened Apr 1, 2026 by nsubaru Loading…
Add HunYuan Dense V1 (hunyuan_v1_dense) model support
#2045 opened Mar 25, 2026 by amdrajeevp1 Contributor Loading…
[VitisAI] external_ep_library typo fix
#2027 opened Mar 13, 2026 by akholodnamdcom Contributor Loading…
Add Qwen3.5 support
#2025 opened Mar 13, 2026 by kinfey Contributor Loading…
[Don't review] Optimizations for graph capture
#2011 opened Mar 6, 2026 by qjia7 Contributor Draft
GenAI changes to support EPContext compilation and validation
#1993 opened Feb 27, 2026 by lnigam Contributor Loading…
Add model builder support for LFM2
#1979 opened Feb 14, 2026 by xenova Contributor Loading…
[Draft] Parakeet export
#1977 opened Feb 12, 2026 by jiafatom Contributor Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.