Add slime RL post-training image by abatilo · Pull Request #125 · coreweave/ml-containers

abatilo · 2026-02-24T15:47:28Z

WIP: Add slime RL post-training image

Adds Docker build infrastructure for THUDM/slime, an RL post-training framework that coordinates Megatron-LM (training) + SGLang (rollout inference) via Ray.

What this does

Layers the slime training stack on top of the existing sglang image:

Component	Version	Why
TransformerEngine	2.10.0	Upgraded from base 2.4 (slime requirement)
Apex	`10417ace`	Slime-pinned commit (different from torch-extras)
SGLang (patched)	`4b6f62e2`	42-file patch for RL weight sync, scheduling, memory management
Megatron-LM (patched)	`3714d81d`	17-file patch for MoE routing replay, sandwich norm, MTP fixes
slime	`b964eedc`	RL post-training framework + int4_qat CUDA kernel

Architecture

Follows the established ml-containers two-stage build pattern:

sglang image (base)
  └── builder stage: build.bash compiles TE 2.10, apex, patched sglang, int4_qat → /wheels/
  └── final stage: install.bash installs wheels + Megatron-LM (editable) + slime (editable)

Inherits Blackwell sm_100a support and multi-arch (amd64+arm64) from the base sglang image.

TODO

Validate with a real docker build against the latest sglang image
Smoke test: import slime, megatron, sglang, transformer_engine, apex
Confirm arm64 build works
Update default base image tag in workflow once latest sglang is built

abatilo · 2026-02-24T15:47:30Z

This change is part of the following stack:

feat(sglang): upgrade to v0.5.9 on plain torch base for SLIME support #128
- Add slime RL post-training image #125 ◀

_{Change managed by git-spice.}

github-actions · 2026-02-24T16:34:46Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22359629088
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-6bbebab-d6cea4b-386fabe-nccl-cuda12.8.0-ubuntu22.04-nccl2.25.1-1-torch2.6.0-vision0.21.0-audio2.6.0-abi1

github-actions · 2026-02-25T10:53:56Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22393112086
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-c469ae4-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T14:37:46Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22401124059
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-b896655-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T15:51:02Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22393112087
Image: ghcr.io/coreweave/ml-containers/sglang:abatilo-slime-c469ae4-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T15:51:29Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22403841820
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-2a176de-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T17:02:52Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22406520501
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-9b08cb3-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T21:31:16Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22416790591
Image: ghcr.io/coreweave/ml-containers/sglang:abatilo-slime-79ebda6-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T21:40:33Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22416790628
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-79ebda6-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T22:07:58Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22417654701
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-67b6432-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-25T22:08:37Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22417446269
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-07a6203-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-27T17:08:14Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22495614431
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-8403328-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-27T22:37:56Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22495614471
Image: ghcr.io/coreweave/ml-containers/sglang:abatilo-slime-8403328-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-27T23:59:36Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22508310210
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-029b183-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-02-28T04:03:50Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22512778429
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-006aa6e-abatilo-sglang-d8259f9-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-03-03T23:41:45Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22647629481
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-a58dea5-abatilo-sglang-d5fb1dd-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-03-04T16:38:25Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22678705175
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-fbbd7f8-abatilo-sglang-d5fb1dd-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-03-05T03:22:10Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22700394547
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-8891b14-abatilo-sglang-d5fb1dd-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-03-05T04:15:20Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22701505825
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-5faef68-abatilo-sglang-d5fb1dd-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

github-actions · 2026-03-05T18:09:43Z

@abatilo Build complete, success: https://github.com/coreweave/ml-containers/actions/runs/22729124486
Image: ghcr.io/coreweave/ml-containers/slime:abatilo-slime-e7cf547-abatilo-sglang-d5fb1dd-nccl-cuda12.9.1-ubuntu22.04-nccl2.29.2-1-torch2.10.0-vision0.25.0-audio2.10.0-abi1

SLIME's post-training pipeline (Megatron + SGLang) requires SGLang v0.5.9. This upgrades from v0.4.x and rebases the image onto a plain torch base, dropping the torch-extras layer (DeepSpeed, Apex, xFormers) that neither SGLang nor SLIME actually uses. FlashInfer moves from JIT to v0.6.3 AOT compilation via TVM. sgl-kernel is now built with scikit-build-core and enables SM100A (Blackwell) and FP4 support. vLLM and Triton are removed from this image since they are served by the dedicated vllm-tensorizer image.

SLIME combines Megatron-LM and SGLang for reinforcement learning based post-training of large language models. This image builds TransformerEngine 2.10, Apex, and a patched SGLang wheel on top of our sglang base, then installs Megatron-LM with SLIME's patches for routing replay and memory management. Patches are versioned under slime/patches/v0.5.7/ and documented in README.md with their upstream origins from THUDM/SLIME.

abatilo force-pushed the abatilo/slime branch from 6bbebab to c469ae4 Compare February 25, 2026 10:37

This was referenced Feb 25, 2026

feat(slime): replace sglang+slime workflows with buildx bake chain #126

Closed

feat(sglang): upgrade to v0.5.9 on plain torch base for SLIME support #128

Open

abatilo changed the base branch from main to abatilo/sglang February 25, 2026 19:16

abatilo force-pushed the abatilo/sglang branch from d8259f9 to 2642fae Compare February 25, 2026 21:30

abatilo force-pushed the abatilo/slime branch from 9b08cb3 to 79ebda6 Compare February 25, 2026 21:30

abatilo force-pushed the abatilo/slime branch from 67b6432 to 8403328 Compare February 27, 2026 16:58

abatilo force-pushed the abatilo/slime branch from 8403328 to e1231a4 Compare February 27, 2026 23:33

abatilo force-pushed the abatilo/sglang branch from f074022 to df08c06 Compare February 27, 2026 23:38

abatilo force-pushed the abatilo/slime branch from e1231a4 to 0bef9f4 Compare February 27, 2026 23:38

abatilo force-pushed the abatilo/sglang branch from df08c06 to fd74de2 Compare February 27, 2026 23:47

abatilo force-pushed the abatilo/slime branch from 0bef9f4 to 029b183 Compare February 27, 2026 23:47

abatilo force-pushed the abatilo/sglang branch from fd74de2 to 38f8a15 Compare February 28, 2026 03:52

abatilo force-pushed the abatilo/slime branch from 029b183 to 006aa6e Compare February 28, 2026 03:52

abatilo force-pushed the abatilo/slime branch from 006aa6e to b34dc7a Compare February 28, 2026 19:59

abatilo force-pushed the abatilo/slime branch from 94f34fd to a58dea5 Compare March 3, 2026 23:30

abatilo force-pushed the abatilo/slime branch from a58dea5 to fbbd7f8 Compare March 4, 2026 16:27

abatilo force-pushed the abatilo/slime branch 4 times, most recently from dffa047 to 8891b14 Compare March 5, 2026 03:08

abatilo force-pushed the abatilo/slime branch 2 times, most recently from 10fe935 to 5faef68 Compare March 5, 2026 03:55

abatilo force-pushed the abatilo/slime branch 2 times, most recently from 597ffe5 to e7cf547 Compare March 5, 2026 17:38

abatilo force-pushed the abatilo/sglang branch from d5fb1dd to 17f8fd0 Compare March 5, 2026 22:47

abatilo force-pushed the abatilo/slime branch from e7cf547 to 7629697 Compare March 5, 2026 22:49

abatilo force-pushed the abatilo/sglang branch from 17f8fd0 to a8a4833 Compare March 6, 2026 22:12

abatilo force-pushed the abatilo/slime branch from 7629697 to e8eb91f Compare March 6, 2026 22:12

abatilo force-pushed the abatilo/sglang branch from a8a4833 to eeadc43 Compare March 7, 2026 00:37

abatilo force-pushed the abatilo/slime branch from e8eb91f to ef1fc27 Compare March 7, 2026 00:37

abatilo force-pushed the abatilo/slime branch from ef1fc27 to 324e8ec Compare March 7, 2026 06:46

abatilo force-pushed the abatilo/sglang branch 6 times, most recently from d246ce3 to 34f06f0 Compare March 9, 2026 14:37

Conversation

abatilo commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

WIP: Add slime RL post-training image

What this does

Architecture

TODO

Uh oh!

abatilo commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 28, 2026

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

github-actions bot commented Mar 4, 2026

Uh oh!

github-actions bot commented Mar 5, 2026

Uh oh!

github-actions bot commented Mar 5, 2026

Uh oh!

github-actions bot commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

abatilo commented Feb 24, 2026 •

edited

Loading

abatilo commented Feb 24, 2026 •

edited

Loading