From autoworker
Final quality gate before reporting task completion (Gate 3). Fills confidence assessment, runs supplementary verification for <95% items, completes self-check checklist. Writes PASS/FAIL result to subtask.md, then calls autoworker:dispatch for routing.
npx claudepluginhub phj128/autoworker --plugin autoworkerThis skill uses the workspace's default tool permissions.
**Trigger**: Called by autoworker:dispatch when all tests are complete. **Pure assessment skill — does not make routing decisions.**
Complete subtask verification plan: upstream traceability table + L1-L4 test plan + self-check. Call after autoworker:subtask-init. Makes subtask ready for code implementation.
Validates AI agent claims against evidence trail in coding workflows. Catches unsubstantiated 'done', 'tests pass', 'fixed' without proof like outputs, diffs, or logs. Auto-triggers on completion keywords.
Enforces evidence-based verification before claiming tasks, features, or PRs complete. Requires pasting test outputs, command runs, and behavioral checks; rejects vague assertions.
Share bugs, ideas, or general feedback.
Trigger: Called by autoworker:dispatch when all tests are complete. Pure assessment skill — does not make routing decisions.
Glob `subtask_*.md` (exclude subtask_template.md) →
0 found → stop, prompt to create subtask
1 found → use directly (backward compatible)
multiple → grep `status:` to filter:
- Files without status field treated as active (backward compatible)
- Exactly 1 active → use it
- 0 active → list all files + status, prompt user to choose
- >1 active → report anomaly
→ Read → check "Test Results" section
autoworker:checkpoint firstRead subtask's "Acceptance Criteria" table, check whether each metric was measured in L1-L4 test results:
In subtask.md's "Confidence Assessment" section, fill in for each change point:
| Change point | Test level | Confidence | Verification method | Unverified/Risk |
|---|
Confidence inference basis:
When the table above has < 95% items:
Record in subtask.md's "< 95% Supplementary Verification" table.
Boundary for "requires user confirmation" — only scenarios depending on human senses qualify:
When all items are >= 95%, write "All >= 95%, no supplementary verification needed".
In subtask.md's "Gate 3 Self-Check" section, check each item and provide evidence:
Can't write evidence = didn't do it = can't check the box = FAIL.
Answer each question (any No → FAIL):
PASS (all self-checks pass + all >= 95%):
Edit subtask.md:
Gate result: PASS at the end of "Progress Log" sectionstatus: active to status: completed (if status field exists)Output:
Gate 3 PASS
- Confidence: all >= 95%
- Self-check: 5/5 passed
- Chain: complete
→ Invoking autoworker:dispatch
FAIL loop limit: If already consecutively FAIL 2 times (check "Progress Log" for Gate result: FAIL count), on the 3rd FAIL, do not invoke autoworker:subtask-update. Instead output a complete failure report to the user and let them decide next steps.
FAIL (any self-check fails or has < 95% that can't be self-resolved):
Edit subtask.md, append at end of "Progress Log" section:
Gate result: FAIL
Output:
Gate 3 FAIL
- Failed items: <specifics>
- Needs additional work: <specifics>
→ Invoking autoworker:dispatch
Both cases always invoke autoworker:dispatch. No routing decisions — autoworker:dispatch reads Gate result and decides next step.
After outputting the result, immediately invoke autoworker:dispatch. Do not wait for user instructions, do nothing else.
Gate result: PASS or Gate result: FAIL — autoworker:dispatch reads this exact format