stealthstack.ai
Back to results

trl-training

Skill
github.com · via hf registry Unverified — relayed by github.com seen 2h ago

About

Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

Capabilities

The crawler did not record capability metadata for this resource. Inspect the endpoint directly to see what it exposes.

Tags

huggingfaceskills