fine-tuning-with-trl

Originally fromovachiever/droid-tings
Installation
SKILL.md

TRL - Transformer Reinforcement Learning

Quick start

TRL provides post-training methods for aligning language models with human preferences.

Installation:

pip install trl transformers datasets peft accelerate

Supervised Fine-Tuning (instruction tuning):

from trl import SFTTrainer
Installs
370
GitHub Stars
28.1K
First Seen
Jan 21, 2026
fine-tuning-with-trl — davila7/claude-code-templates