Stats
Actions
Tags
Help us improve
Share bugs, ideas, or general feedback.
From security-guardrails
Guides authoring and reviewing red-team eval plugins, attack templates, grader rubrics, safety fixtures, and model-risk test metadata.
How this skill is triggered — by the user, by Claude, or both
Slash command
/security-guardrails:red-team-eval-authoringThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- Adding a new red-team plugin or grader.
Share bugs, ideas, or general feedback.
{ reason, pass, score }.references/redteam-grader-checklist.mdnpx claudepluginhub yeaight7/agent-powerups --plugin security-guardrailsOrchestrates changing an existing working feature to new desired behavior by updating tests first, then implementation, with review and gated commit.