-
Notifications
You must be signed in to change notification settings - Fork 281
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add MIGraphX execution provider support in Model Builder
#2086
opened Apr 14, 2026 by
ndevarapa
Loading…
Reduce CPU-side per-token overhead in GenerateNextToken and SampleTopP
#2085
opened Apr 14, 2026 by
hanbitmyths
Collaborator
Loading…
[WebGPU] Support continuous decoding (RewindTo) with graph capture
#2083
opened Apr 13, 2026 by
qjia7
Contributor
Loading…
[Don't merge it] Fix quark quantize weight loading for Qwen3-VL-4B text model
#2082
opened Apr 13, 2026 by
Tianping-amd
Loading…
extend modelbuilder to build Olmo3, SmolLM3 and other models
#2078
opened Apr 10, 2026 by
xadupre
Member
Loading…
Add Mistral3/Pixtral VLM support: image processor, model builder, parity tests
#2077
opened Apr 8, 2026 by
titaiwangms
Loading…
Fix YaRN RoPE bugs in model builder and add parity tests
#2076
opened Apr 8, 2026 by
titaiwangms
Loading…
Enable CUDA graph capture for CUDA EP to improve decode throughput
#2070
opened Apr 7, 2026 by
apsonawane
Contributor
Loading…
Fix CUDA build with MSVC by enabling /Zc:preprocessor for nvcc host compilation on VS 16.5 or greater
#2054
opened Apr 1, 2026 by
nsubaru
Loading…
Add HunYuan Dense V1 (hunyuan_v1_dense) model support
#2045
opened Mar 25, 2026 by
amdrajeevp1
Contributor
Loading…
Rename NemotronCacheConfig to NemotronConfig and add blank penalty to the decoder
#2042
opened Mar 22, 2026 by
nenad1002
Contributor
Loading…
[VitisAI] external_ep_library typo fix
#2027
opened Mar 13, 2026 by
akholodnamdcom
Contributor
Loading…
GenAI changes to support EPContext compilation and validation
#1993
opened Feb 27, 2026 by
lnigam
Contributor
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.