trl-training

Skill

github.com · via hf registry Unverified — relayed by github.com seen 2h ago

About

Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

Capabilities

The crawler did not record capability metadata for this resource. Inspect the endpoint directly to see what it exposes.

trl-training

About

Capabilities

Tags

You might also need