tinker
Installation
SKILL.md
Tinker API - LLM Fine-Tuning
Overview
Tinker is a training API for large language models from Thinking Machines Lab. It provides:
- Supervised Fine-Tuning (SFT): Train models on instruction/completion pairs
- Reinforcement Learning (RL): PPO and policy gradient losses; cookbook patterns include GRPO-like group rollouts/advantage centering
- Vision-Language Models: VLM support via Qwen3-VL
- LoRA Training: Efficient parameter-efficient fine-tuning
Two abstraction levels:
- Tinker Cookbook: High-level patterns with automatic training loops
- Low-Level API: Manual control for custom training logic