unsloth-training
Installation
SKILL.md
- GRPO - RL with reward functions (no labeled outputs needed)
- SFT - Supervised fine-tuning with input/output pairs
- Vision - VLM fine-tuning (Qwen3-VL, Gemma3, Llama 3.2 Vision)
Key capabilities:
- FP8 Training - 60% less VRAM, 1.4x faster (RTX 40+, H100)
- 3x Packing - Automatic 2-5x speedup for mixed-length data
- Docker - Official
unsloth/unslothimage - Mobile - QAT → ExecuTorch → iOS/Android (~40 tok/s)
- Export - GGUF, Ollama, vLLM, LM Studio, SGLang