Community Plugin

grpo-rl-training

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training. Group Relative Policy Optimization enables efficient reinforcement learning without critic models. Use when training models for reasoning, math, coding, or task-specific improvements.

1.0.0

Updated 25 days ago

Capabilities

Commands

Agents

Skills

Hooks

MCP Servers

Install

Add the repository(one-time)

/plugin marketplace add zechenzhangAGI/AI-research-SKILLs

Install the plugin

/plugin install grpo-rl-training@zechenzhangAGI/AI-research-SKILLs

Component Details

No components detected in this plugin's metadata.

Stats

Stars00123456789

MaintenanceGood

Last Commit25 days ago

Links

View on GitHub

View README

Plugin Marketplace JSON

Similar Plugins

pr-review-toolkit

Comprehensive PR review agents specializing in comments, tests, error handling, type design, code quality, and code simplification

46.0K

code-review

Automated code review for pull requests using multiple specialized agents with confidence-based scoring

grpo-rl-training

Similar Plugins

pr-review-toolkit

code-review

agent-sdk-dev

plugin-dev