fine-tuning-with-trl

Originally fromovachiever/droid-tings
Installation
SKILL.md

TRL - Transformer Reinforcement Learning

Quick start

TRL provides post-training methods for aligning language models with human preferences.

Installation:

pip install trl transformers datasets peft accelerate

Supervised Fine-Tuning (instruction tuning):

from trl import SFTTrainer
Installs
100
GitHub Stars
10.1K
First Seen
Jan 21, 2026
fine-tuning-with-trl — zechenzhangagi/ai-research-skills