slime-rl-training

Originally fromdavila7/claude-code-templates

Installation

SKILL.md

slime: LLM Post-Training Framework for RL Scaling

slime is an LLM post-training framework from Tsinghua's THUDM team, powering GLM-4.5, GLM-4.6, and GLM-4.7. It connects Megatron-LM for training with SGLang for high-throughput rollout generation.

When to Use slime

Choose slime when you need:

Megatron-LM native training with SGLang inference
Custom data generation workflows with flexible data buffers
Training GLM, Qwen3, DeepSeek V3, or Llama 3 models
Research-grade framework with production backing (Z.ai)

Consider alternatives when:

You need enterprise-grade stability features → use miles
You want flexible backend swapping → use verl
You need PyTorch-native abstractions → use torchforge

Key Features

Related skills

More from kiterlin/intelligent-detection-system

Installs

Repository

kiterlin/intell…n-system

GitHub Stars

First Seen

Apr 21, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykWarn

slime-rl-training

slime: LLM Post-Training Framework for RL Scaling

When to Use slime

Key Features

More from kiterlin/intelligent-detection-system

tensorrt-llm

optimizing-attention-flash

ray-data

pytorch-fsdp2

ml-paper-writing

ray-train