Train or fine-tune language models using TRL on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes hardware selection, cost estimation, Trackio monitoring, and Hub persistence.
/plugin marketplace add huggingface/skills/plugin install hugging-face-model-trainer@huggingface-skillsInteractive learning mode that requests meaningful code contributions at decision points (mimics the unshipped Learning output style)
Comprehensive PR review agents specializing in comments, tests, error handling, type design, code quality, and code simplification
Automated code review for pull requests using multiple specialized agents with confidence-based scoring
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review