Stats
Actions
Tags
Help us improve
Share bugs, ideas, or general feedback.
From agent-evaluation-lab
Guides authoring and reviewing red-team eval plugins, attack templates, grader rubrics, safety fixtures, and model-risk test metadata.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-evaluation-lab:red-team-eval-authoringThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- Adding a new red-team plugin or grader.
Share bugs, ideas, or general feedback.
{ reason, pass, score }.references/redteam-grader-checklist.mdnpx claudepluginhub yeaight7/agent-powerups --plugin agent-evaluation-labGuides technical evaluation of code review feedback: read fully, restate for understanding, verify against codebase, respond with reasoning or pushback before implementing.