forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 49
Pull requests: ROCm/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Optimize bf16 wvSplitK_int4 dequant and DOT2C for gfx1151
#909
opened Apr 29, 2026 by
mgehre-amd
Loading…
Remove .buildkite directory to test CI trigger behavior
#907
opened Apr 28, 2026 by
mgehre-amd
•
Draft
Support group_size=64 in HybridW4A16 and wvSplitK_int4_g
#905
opened Apr 27, 2026 by
mgehre-amd
Loading…
[ROCm][DSv4] Share AITER decode dequant + fp8-cast buffers across layers (rebased, stacked on #902)
#903
opened Apr 27, 2026 by
ChuanLi1101
•
Draft
2 of 4 tasks
[ROCm][DSv4] Make AITER sparse decode cudagraph-clean (rebased, stacked on #901)
#902
opened Apr 27, 2026 by
ChuanLi1101
•
Draft
2 of 5 tasks
[ROCm][DSv4] AITER-accelerated MLA decode for DeepSeek V4 on MI355X (rebased on tj/dsv4prrebase)
#901
opened Apr 27, 2026 by
ChuanLi1101
•
Draft
1 of 4 tasks
[Do Not Merge] For review purpose: Rocm/aiter mla dsv4 decode cudagraph
#900
opened Apr 26, 2026 by
tjtanaavllm
•
Draft
5 tasks
[ROCm] support topk_softplus for all number of experts
#899
opened Apr 25, 2026 by
tjtanaa
Loading…
5 tasks
[WiP] CI test for automated attention benchmarking suite
#897
opened Apr 24, 2026 by
amd-callumm
Loading…
Add profiler record_function annotation to HybridW4A16MoEExperts decode
#896
opened Apr 24, 2026 by
roberteg16
Loading…
1 task done
Tune hybrid_triton_w4a16 prefill kernel for gfx1151
#879
opened Apr 15, 2026 by
mgehre-amd
•
Draft
3 tasks done
Use fp16 fdot2 for bf16 int4 GEMV dequant on RDNA 3.5
#877
opened Apr 15, 2026 by
mgehre-amd
•
Draft
Fix Triton WNA16 MoE fallback for CompressedTensorsWNA16MoEMethod
#875
opened Apr 15, 2026 by
mgehre-amd
•
Draft
Enable FLASH_ATTN backend with upstream flash-attn CK on ROCm
#866
opened Apr 10, 2026 by
mgehre-amd
•
Draft
1 task
[CI/Build] Fix Dockerfile.rocm_base image build for ROCm 7.2
bug
Something isn't working
#863
opened Feb 23, 2026 by
jbelloncastro
Loading…
3 of 5 tasks
[Triton] fix rope kv_cache for RocmAiterUnifiedAttentionImpl + v.contiguous() remove
stale
#851
opened Jan 2, 2026 by
k50112113
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-26.