From agentic-toolkit
Validate completed work using deterministic automated checks with 6-category scoring and letter grades (S/A/B/C/D/F)
npx claudepluginhub corbinatorx/devops-ai-toolkit-claude-plugin --plugin agentic-toolkitThis skill is limited to using the following tools:
Objective quality assurance that validates completed work against defined acceptance criteria using deterministic, automated checks. Provides scored feedback with letter grades and actionable improvement suggestions.
Compares coding agents like Claude Code and Aider on custom YAML-defined codebase tasks using git worktrees, measuring pass rate, cost, time, and consistency.
Designs and optimizes AI agent action spaces, tool definitions, observation formats, error recovery, and context for higher task completion rates.
Designs, implements, and audits WCAG 2.2 AA accessible UIs for Web (ARIA/HTML5), iOS (SwiftUI traits), and Android (Compose semantics). Audits code for compliance gaps.
Objective quality assurance that validates completed work against defined acceptance criteria using deterministic, automated checks. Provides scored feedback with letter grades and actionable improvement suggestions.
This Skill delegates to the manager agent for comprehensive quality validation.
Task paths default to .agentic/tasks/active/. This can be overridden via .claude/config.json:
{
"documentation": {
"taskPath": ".agentic/tasks/active"
}
}
If documentation.taskPath is set in config, use that path instead of the default.
This Skill automatically activates when users mention:
"Review task payment-service/phase1#2.1"
"Validate task 3.5"
"Check quality of phase2#4.2"
"Review the completed implementation"
Full path: payment-service/phase1#3.1
Short format: 3.1 - If context is clear
With description: payment-service/phase1#3.1 Payment Service Implementation
Extract service/feature name, phase number, and task number.
Location: .agentic/tasks/active/{service-or-feature-name}/{phase}.md
Find task section by searching for #### {task-number}.
Checkbox Tasks: Lines starting with - [ ] or - [x]
Acceptance Criteria: Section starting with **Acceptance Criteria:**
Location: File paths in **Location:** annotations
Commands: Build/test commands from task or config
Invoke the manager agent to run automated checks and validation. The agent will:
Read build command from:
.claude/config.json under testing.commands.buildTech stack defaults:
dotnet buildnpm run build or npm run type-checkpython -m py_compile or mypy .go build ./...mvn compile or gradle buildRun test command and capture:
For TypeScript, Python with type hints:
# TypeScript
tsc --noEmit
# Python
mypy src/
# JavaScript/TypeScript
npm run lint
# Python
flake8 src/
# .NET
dotnet format --verify-no-changes
# Node.js
npm audit
# Python
safety check
For each criterion:
IMPORTANT: Always write to: .agentic/tasks/active/{service-or-feature}/{phase}_status.md
6 Categories (0-20 points each, 100 total):
Scoring:
Scoring:
Scoring:
Scoring:
Scoring:
Scoring:
Total Score → Letter Grade:
The manager generates a comprehensive status file:
# Phase {N} Implementation Status Report
## {Feature Name} - {Phase Title}
**Review Date:** {YYYY-MM-DD}
**Phase Status:** {✅ IMPLEMENTATION COMPLETE | ⚠️ NEEDS REWORK | ⏸️ IN PROGRESS}
**Overall Score:** {score}/100
**Letter Grade:** {S | A | B | C | D | F}
**Reviewer:** Manager (Automated Quality Assurance)
---
## Scoring Breakdown
| Category | Score | Grade | Status |
|----------|-------|-------|--------|
| Completeness | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| Code Quality | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| Architecture | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| Security | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| Testing | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| Documentation | {X}/20 | {letter} | {✅/⚠️/⏸️} |
| **TOTAL** | **{X}/100** | **{letter}** | **{status}** |
---
## Detailed Assessment
### 1. Completeness ({X}/20) - {Grade}
{checkbox completion, acceptance criteria status, issues}
### 2. Code Quality ({X}/20) - {Grade}
{strengths, issues, linting results}
### 3. Architecture ({X}/20) - {Grade}
{blueprint adherence, structural issues}
### 4. Security ({X}/20) - {Grade}
{security checks, vulnerabilities}
### 5. Testing ({X}/20) - {Grade}
{test results, coverage analysis}
### 6. Documentation ({X}/20) - {Grade}
{documentation present, missing docs}
---
## Tech Debt Identified
### High Priority
- [ ] {critical issue to address}
### Medium Priority
- [ ] {important improvement}
### Low Priority
- [ ] {nice-to-have improvement}
---
## Recommendations
### Must Fix (Blocking Issues)
1. {critical issue}
### Should Fix (Important)
1. {important issue}
### Could Improve (Optional)
1. {enhancement}
---
## Conclusion
**Overall Assessment**: {summary paragraph}
**Next Steps**: {action items}
**Ready for Production**: {Yes | No | With Caveats}
The Manager (via manager agent):
The Manager does NOT:
Pass Threshold: 75 points (B grade) or higher Production Ready: 85 points (A grade) or higher Rework Required: Below 65 points (C grade)
Validates work from:
implement-task Skill)Feeds back to:
Used in development cycle:
1. /blueprint - Architect creates architecture
2. /blueprint-tasks - Convert to tasks
3. /implement-task - Builder implements
4. /review-task - Manager validates ← YOU ARE HERE
5. [If review fails, rerun /implement-task with fixes]
6. /commit - Create commit
7. [Repeat for next task]
8. /create-pr - Create PR
Reads .claude/config.json for:
❌ Phase File Not Found
Phase file not found at: .agentic/tasks/active/{service}/{phase}.md
Cannot review task without phase file.
Troubleshooting:
- Verify service/feature name is correct
- Check phase number is correct
- Ensure task has been implemented
❌ Task Not Implemented
Task {task-number} appears incomplete (unchecked checkboxes).
Suggestion: Implement the task first with /implement-task {task-number}
⚠️ Automated Checks Failed
**Build Status**: ❌ Failed
**Tests**: ⚠️ {X} tests failed
The review will continue but these failures will result in low scores.
Continuing with review...